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Commissioner for Patents 
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Dear Sir: 

I, J. Christopher Grimaldi, declare and state as follows: 

1 . I am a Senior Research Associate in the Molecular Biology Department of 
Genentech, Inc., South San Francisco, CA 94080. 

2. My scientific Curriculum Vitae, including my list of publications, is attached to 
and forms part of this Declaration (Exhibit A). 

3. I joined Genentech in January of 1999. From 1999 to 2003, 1 directed the Cloning 
Laboratory in the Molecular Biology Department. During this time I directed or performed 
numerous molecular biology techniques including semi-quantitative Polymerase Chain Reaction 
(PCR) analyses. I am. currently involved, among other projects, in the isolation of genes coding 
for membrane associated proteins which can be used as targets for antibody therapeutics against 
cancer. In connection with the above- identified patent application, I personally performed or 
directed the semi-quantitative PCR gene expression analyses in the assay entitled "Tumor Versus 
Normal Differential Tissue Expression Distribution," which is described in EXAMPLE 18 in, the 
specification. These studies were used to identify differences in gene expression between tumor 
tissue and their normal counterparts. 

4. EXAMPLE 18 reports the results of the PCR analyses conducted as part of the 
investigating of several newly discovered DNA sequences. This process included developing 
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primers and analyzing expression of the DNA sequences of interest in normal and tumor tissues. 
The analyses were designed to determine whether a difference exists between gene expression in 
normal tissues as compared to tumor in the same tissue type. 

5. The DNA libraries used in the gene expression studies were made from pooled 
samples of normal and of tumor tissues. Data from pooled samples is more likely to be accurate 
than data obtained from a sample from a single individual. That is, the detection of variations in 
gene expression is likely to represent a more generally relevant condition when pooled samples 
from normal tissues are compared with pooled samples from tumors in the same tissue type. 

6. In differential gene expression studies, one looks for genes whose expression levels 
differ significantly under different conditions, for example, in normal versus diseased tissue. 
Thus, I conducted a semi-quantitative analysis of the expression of the DNA sequences of 
interest in normal versus tumor tissues. Expression levels were graded according to a scale of +, - 
, and +/- to indicate the amount of the specific signal detected. Using the widely accepted 
technique of PCR, it was determined whether the polynucleotides tested were more highly 
expressed, less expressed, or whether expression remained the same in tumor tissue as compared 
to its normal counterpart. Because this technique relies on the visual detection of ethidium 
bromide staining of PCR products on agarose gels, it is reasonable to assume that any detectable 
differences seen between two samples will represent at least a two fold difference in cDNA. 

7. The results of the gene expression studies indicate that the genes of interest can be 
used to differentiate tumor from normal. The precise levels of gene expression are irrelevant; 
what matters is that there is a relative difference in expression between normal tissue and tumor 
tissue. The precise type of tumor is also irrelevant; again, the assay was designed to indicate 
whether a difference exists between normal tissue and tumor tissue of the same type. If a 
difference is detected, this indicates that the gene and its corresponding polypeptide and 
antibodies against the polypeptide are useful for diagnostic purposes, to screen samples to 
differentiate between normal and tumor. Additional studies can then be conducted if further 
information is desired. 

8. I hereby declare that all statements made herein of my own knowledge are true and 
that all statements made on information or belief are believed to be true, and further that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code and that such willful statements may jeopardize the validity of the application or any 
patent issued thereon. 
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J. Christopher Grimaldi 



1434-36* Ave. 

San Francisco, CA 94122 

(415) 681-1639 (Home) 



EDUCATION 



University of California, Berkeley 
Bachelor of Arts in Molecular Biology, 1984 



EMPLOYMENT EXPERIENCE 



SRA 



Genentech Inc., South San Francisco; 1/99 to present 



Previously, was responsible to direct and manage the Cloning Lab. Currently focused on 
isolating cancer specific genes for the Tumor Antigen (TAP), and Secreted Tumor Protein 
(STOP) projects for the Oncology Department as well as Immunologically relevant genes for the 
Immunology Department. Directed a lab of 6 scientists focused on a company-wide team effort 
to identify and isolate secreted proteins for potential therapeutic use (SPDI). For the SPDI projec 
my duties were, among other things, the critically important coordination of the cloning of 
thousands of putative genes, by developing a smooth process of communication between the 
Bioinformatics, Cloning, Sequencing, and Legal teams. Collaborated with several groups to 
discover novel genes through the Curagen project, a unique differential display methodology. 
Interacted extensively with the Legal team providing essential data needed for filing patents on 
novel genes discovered through the SPDI, TAP and Curagen projects. My group has developed, 
implemented and patented high throughput cloning methodologies that have proven to be 
essential for the isolation of hundreds of novel genes for the SPDI, TAP and Curagen projects as 
well as dozens of other smaller projects. 



Scientist DNAX Research Institute, Palo Alto; 9/91 to 1/99 

Involved in multiple paageets aimed at understanding novel genes discovered through 
bioinformatics studies and functional assays. Developed and patented a method for the specific 
depletion of eosinophils in vivo using monoclonal antibodies. Developed and implemented 
essential technical methodologies and provided strategic direction in the areas of expression, 
cloning, protein purification, general molecular biology, and monoclonal antibody production. 
Trained and supervised numerous technical staff. 



Directed plant-related activities, which included expansion planning, maintenance, safety, 
purchasing, inventory control, shipping and receiving, and laboratory management. Designed 
and implemented the safety program. Also served as liaison to regulatory agencies at the local, 
state and federal level. Was in charge of property leases, leasehold improvements, etc. 
Negotiated vendor contracts and directed the purchasing department. Trained and supervised 
personnel to carry out the above-mentioned duties. 



Facilities 
Manager 



Corixa, Redwood City; 5/89 - 7/91. 



SRA University of California, San Francisco 

Cancer Research Institute; 2/87-4/89. 



Was responsible for numerous cloning projects including: studies of somatic hypermutation, 
studies of AIDS-associated lymphomas, and cloning of t(5; 14), t(l 1;14), and t(8;14) 
translocations. Focused on the activation of hemopoietic growth factors involved in the t(5;14) 
translocation in leukemia patients- 
Research 

Technician Berlex Biosciences, South San Francisco; 7/85-2/87. 

Worked on a subunit porcine vaccine directed against Mycoplasma hyopneumoniae. Was 
responsible for generating genomic libraries, sheening with degenerate oligonucleotides, and 
characterizing and expressing clones in E. coli. Also constructed a general purpose expression 
vector for use by other scientific teams. 

PUBLICATIONS 

1. Hilary F. Clark, et al. "The Secreted Protein Discovery Initiative (SPDI), a Large-scale 
Effort to Identify Novel Human Secreted and Transmembrane Proteins: a bioinformatics 
assessment." Genome Res. Vol 13(10), 2265-2270, 2003 

2* Sean H. Adams, Clarissa Chui' Sarah L. Schilbach, Xing Xian Yu, Audrey D. Goddard, J. 
Christopher Grimaldi, James Lee, Patrick Dowd, David A. Lewin, & Steven Colman "BFTT, 
a Unique Acyl-CoA Thioesterase Induced in Thermogenic Brown Adiopose Tissue: Cloning, 
organization of the humanb gene and assessment of a potential link to obesity" Biochemical 
Journal, Vol 360, 135-142, 2001 

3. Szeto W, Jiang W, Tice DA, Rubinfeld B, Hollingshead PG, Fong SE, Dugger DL, Pham T, 
Yansura D, Wong TA, Grimaldi JC, Corpuz RT, Singh JS, Frantz GD, , Devaux B, Crowley 
CW, SphwallgHJEberh^rd DA.itastelli L, Pokkis.P, and Rennica D. "Overexpression of 
the Retenoic Acid-Responsive Gene Stra6 in Human Cancers and its Synergistic Activation 
by Wnt-1 and Retinoic Acid." Cancer Research Vol. 61(10), 4197-4205, 2001 

4. Jeanne Kahn, Fuad Mehraban, Gladdys Ingle, Xiaohua Xin, Juliet E. Bryant, Gordon Vehar, 
Jill Schoenfeld, J. Christopher Grimaldi (incorrectly named as "Grimaldi, CJ"), Franklin 
Peale, Apama Draksharapu, David A. Lewin, and Mary E. Gerritsen. "Gene Expression 
Profiling in an in Vitro Model of Angiogenesis." American Journal of Pathology Vol 156(6), 
1887-1900,2000. 

5. Grimaldi JC, Yu NX, Grunig G, Seymour BW, Cottrez F, Robinson DS, Hosken N, Ferlin 
WG, Wu X, Soto H, O'Garra A, Howard MC, Coffman RL. "Depletion of eosinophils in 
mice through the use of antibodies specific for C-C chemokine receptor 3 (CCR3). Journal of 
Leukocyte Biology; Vol. 65(6), 846-53, 1999 

6. Oliver AM, Grimaldi JC, Howard MC, Kearney JF. "Independently ligating CD38 and Fc 
gammaRIIB relays a dominant negative signal to B cells." Hybridoma Vol. 18(2), 1 13-9, 
1999 



o 



o 



7. Cockayne DA, Muchamuel T, Grimaldi JC, MuUer-Steffner H, Randall TD, Lund FE, 
Murray R Schuber F, Howard MC. "Mice deficient for the ecto-nicotinamide adenine 
dinucleotide glycohydrolase CD38 exhibit altered humoral immune responses " Blood Vol 
92(4), 1324-33, 1998 

8. Frances E. Lund, Nanette W. Solvason, Michael P. Cooke, Andrew W. Heath, J. Christopher 
Grimaldi, Troy D. Randall, R. M. E. Parkhouse, Christopher C Goodnow and Maureen C. 
Howard. "Signaling through murine CD38 is impaired in antigen receptor unresponsive B 
cells." European Journal of Immunology, Vol. 25(5), 1338-1345, 1995 

9. M. J. Guimaraes, J. F. Bazan, A. Zolotnik, M. V. Wiles, J. C. Grimaldi, F. Lee, T. 
McClanahan. "A new approach to the study of haematopoietic development in the yolk sac 
and embryoid body." Development, Vol. 121(10), 3335-3346, 1995 

10. J. Christopher Grimaldi, Sriram Balasujjramanian, J. Fernando Bazan, Armen Shanafelt, 
Gerard Zurawski and Maureen Howard. "CD38-mediated protein ribosylation." Journal of* 
Immunology, Vol. 155(2), 811-817, 1995 

11. Leopoldo Santos- Argumedo, Frances F. Lund, Andrew W. Heath, Nanette Solvason, Wei 
Wei Wu, J. Christopher Grimaldi, R. M. E. Parkhouse and Maureen Howard. "CD38 
unresponsiveness of xid B cells implicates Braton's tyrosine kinase (btk) as a regulator of 
CD38 induced signal transduction," International Immunology, Vol 7(2), 163-170, 1995 

12. Frances Lund, Nanette Solvason, J. Christopher Grimaldi, R. M. E. Parkhouse and Maureen 
Howard. "Murine CD38: An immunoregulatory ectoenzyme." Immunology Todav. Vol 
16(10), 469-473, 1995 

13. Maureen Howard, J. Christopher Grimaldi, J. Fernando Bazan, Frances E. Lund, Leopoldo 
Santos- Argumedo, R. M. E. Parkhouse, Timothy F. Walseth, and Hon Cheung Lee. 
"Formation and Hydrolysis of Cyclic ADP-Ribose Catalyzed by Lymphocyte Antigen 
CD38." Science, Vol. 262, 1056-1059, 1993 

14. NobuyukiJHatada, LeopoldQ r $antos-Arguraedo, Ray Chang, J. Christopher Grimaldi, Frances 
Lund, Camilynn I. Brannan, Neal G. Copeland, Nancy A. Jenkins, Andrew Heath, R. M. E. 
Parkhouse and Maureen Howard. "Expression Cloning of a cDNA Encoding a Novel Murine 
B Cell Activation Marker: Homology to Human CD38." The Journal of Immunology, Vol 
151,3111-3118,1993 

15. David J . Rawlings, Douglas C. Saffran, Satoshi Tsukada, David A. Largaespada, J. 
Christopher Grimaldi, Lucie Cohen Randolph N. Mohr, J. Fernando Bazan, Maureen 
Howard, Neal G. Copeland, Nancy A. Jenkins, Owen Witte. "Mutation of Unique Region of 
Bruton's Tyrosine Kinase in Immunodeficient XID Mice." Science, Vol. 261, 358-360, 1993 

16. J. Christopher Grimaldi, Raul Torres, Christine A. Kozak, Ray Chang, Edward Clark, 
Maureen Howard, and Debra A. Cockayne. "Genomic Structure and Chromosomal Mapping 
of the Murine CD40 Gene. " The Journal of Immunology, Vol 149, 3921-3926, 1992 

17. Timothy C. Meeker, Bruce Shiramizu, Lawrence Kaplan, Brian Herndier, Henry Sanchez, J. 
Christopher Grimaldi, James Baumgartner, Jacab Rachlin, Ellen Feigal, Mark Rosenblum and 
Michael S. McGrath. "Evidence for Molecular Subtypes of HIV- Associated Lymphoma: 



o o 



Division into Peripheral Monoclonal, Polyclonal and Central Nervous System Lymphoma " 
AIDS, Vol. 5, 669-674, 1991 

18. Ann Grimaldi and Chris Grimaldi. "Small-Scale Lambda DNA Prep." Contribution to 
Current Protocols in Molecular Biology, Supplement 5, Winter 1989 

19. J. Christopher Grimaldi, Timothy C. Meeker. "The t(5;14) Chromosomal Translocation in a 
Case of Acute Lymphocytic Leukemia Joins the Interleukin-3 Gene to the Immunoglobulin 
Heavy Chain Gene." Blood, Vol. 73, 2081-2085, 1989 

20. Timothy C. Meeker, J. Christopher Grimaldi, et al. "An Additional Breakpoint Region in the 
BCL-1 Locus Associated with the t(ll;14) (ql3;q32) Translocation of B-Lymphocytic 
Malignancy." Blood, Vol. 74, 1801-1806, 1989 

21 Timothy C. Meeker, J. Christopher Grimaldi, Robert CRourke, et al. "Lack of Detectable 
Somatic Hypermutation in the V Region of the Ig H Chain Gene of a Human , Chronic B 
Lymphocytic Leukemia." The Journal of Immunoiogy, Vol". 141, 3994-3998, 1988 

MANUSCRIPTS IN PREPARATION 

1 . Sriram Balasubramanian, J. Christopher Grimaldi, J. Fernando Bazan, Gerard Zurawski and 
Maureen Howard. "Structural and functional characterization of CD38: Identification of 
active site residues" 



PATENTS 

1. "Methods for Eosinophil Depletion with Antibody to CCR3 Receptor" (US 6,207,155 Bl). 

2. "Amplification Based Cloning Method." (US 6,607,899) 

3. Ashkenazi et al., "Secreted and Transmembrane Polypeptides and Nucleic Acids Encoding 
the Same,." (this, patent coders several hundred genes) 

4. "IL-17 Homologous Polypeptides and Therapeutic Uses Thereof 

5. "Method of Diagnosing and Treating Cartilaginous Disorders." 
MEMBERSHIPS AND ACTIVITIES 

Editor Frontiers in Bioscience 



Member 



DNAX Safety Committee 1991-1999 

Biological Safety Affairs Forum (BSAF) 1990-1991 

Environmental Law Foundation (ELF) 1990-1991 
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DECLARATION OF J. CHRISTOPHER GRIMALDL UNDER 37 CF.R. § 1.132 

Commissioner for Patents 

P.O. Box 1450 

Alexandria, VA 22313-1450 



I, J. Christopher Grimaldi, declare and say as follows: 

1 . I am a Senior Research Associate in the Molecular Biology Department of 
Genentech, Inc., South San Francisco, CA 94080. 

2. I joined Genentech in January of 1999. From 1999 to 2003, 1 directed the Cloning 
Laboratory in the Molecular Biology Department. During this time I directed or performed 
numerous molecular biology techniques including qualitative Polymerase Chain Reaction (PCR) 
analyses. I am currently involved in, among other projects, the isolation of genes coding for 
membrane associated proteins which can be used as targets for antibody therapeutics against 
cancer. In connection with the above-identified patent application, I personally performed or 
directed the semi-quantitative PCR analyses in the assay entitled "Tumor Versus Normal 
Differential Tissue Expression Distribution" which is described in EXAMPLE 1 8 in the 
specification that were used to identify differences in gene expression between tumor tissue and 
their normal counterparts. 

3. My scientific Curriculum Vitae, including my list of publications, is attached to 
and forms part of this Declaration (Exhibit A). 

4. In differential gene expression studies, one looks for genes whose expression levels 
differ significantly under different conditions, for example, in normal versus diseased tissue. 
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Chromosomal aberrations, such as gene amplification, and chromosomal translocations are 
important markers of specific types of cancer and lead to the aberrant expression of specific 
genes and their encoded polypeptides, including over-expression and under-expression. For 
example, gene amplification is a process in which specific regions of a chromosome are 
duplicated, thus creating multiple copies of certain genes that normally exist as a single copy. 
Gene under-expression can occur when a gene is not transcribed into mRNA. In addition, 
chromosomal translocations occur when two different chromosomes break and are rejoined to 
each other chromosome resulting in a chimeric chromosome which displays a different expression 
pattern relative to the parent chromosomes. Amplification of certain genes such as Her2/Neu 
[Singleton et at., Pathol. Annu. , 27Ptl:165-190], or chromosomal translocations such as t(5;14), 
fGrimaldi et al, Blood , 73(8):2081-2085(1989); Meeker et qL, Blood , 76(2):285-289(1990)] give 
cancer cells a growth or survival advantage relative to normal cells, and might also provide a 
mechanism of tumor cell resistance to chemotherapy or radiotherapy. When the chromosomal 
aberration results in the aberrant expression of a mRNA and the corresponding gene product (the 
polypeptide), as it does in the aforementioned cases, the gene product is a promising target for 
cancer therapy, for example, by the therapeutic antibody approach. 

5 . Comparison of gene expression levels in normal versus diseased tissue has 
important implications both diagnostically and therapeutically. For example, those who work in 
this field are well aware that in the vast majority of cases, when a gene is over-expressed, as 
evidenced by an increased production of mRNA, the gene product or polypeptide will also be 
over-expressed. It is unlikely that one identifies increased mRNA expression without associated 
increased protein expression. This same principle applies to gene under-expression. When a 
gene is under-expressed, the gene product is also likely to be under-expressed. Stated in another 
way, two cell samples which have differing mRNA concentrations for a specific gene are 
expected to have correspondingly different concentration of protein for that gene. Techniques 
used to detect mRNA, such as Northern Blotting, Differential Display, in situ hybridization, 
quantitative PCR, Taqman, and more recently Microarray technology all rely on the dogma that a 
change in mRNA will represent a similar change in protein. If this dogma did not hold true then 
these techniques would have little value and not be so widely used. The use of mRNA 
quantitation techniques have identified a seemingly endless number of genes which are 
differentially expressed in various tissues and these genes have subsequently been shown to have 
correspondingly similar changes in their protein levels. Thus, the detection of increased mRNA 
expression is expected to result in increased polypeptide expression, and the detection of 
decreased mRNA expression is expected to result in decreased polypeptide expression. The 
detection of increased or decreased polypeptide expression can be used for cancer diagnosis and 
treatment. 

6. However, even in the rare case where the protein expression does not correlate 
with the mRNA expression, this still provides significant information useful for cancer diagnosis 
and treatment. For example, if over- or under-expression of a gene product does not correlate 
with over- or under-expression of mRNA in certain tumor types but does so in others, then 
identification of both gene expression and protein expression enables more accurate tumor 
classification and hence better determination of suitable therapy. In addition, absence of over- or 



AppLNo. : 10/063,557 

Filed : May 2, 2002 



wider-expression of the gene product in the presence of a particular over- or under-expression of 
mRNA is crucial information for the practicing clinician. For example, if a gene is over-expressed 
but the corresponding gene product is not significantly over-expressed, the clinician accordingly 
will decide jaot to treat a patient with agents that target that gene product. 



7. I hereby declare that all statements made herein of my own knowledge are true and 
that all statements made on information or belief are believed to be true, and further that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code and that such willful statements may jeopardize the validity of the application or any 
patent issued thereon. 



By: 



S:\DOCS\AOK> 
071904 





EXHIBIT A 



0 



o 



J. Christopher Grimaldi 

1434-36* Ave. 

San Francisco, CA 94122 

(415) 681-1639 (Home) 

EDUCATION University of California, Berkeley 

Bachelor of Arts in Molecular Biology, 1984 



EMPLOYMENT EXPERIENCE 



SR A Genentech Inc., South San Francisco; 1/99 to present 

Previously, was responsible to direct and manage the Cloning Lab. €urrently focused on 
isolating cancer specific genes for the Tumor Antigen (TAP), and Secreted Tumor Protein 
(STOP) projects for the Oncology Department as well as Immunologically relevant genes for the 
Immunology Department. Directed a lab of 6 scientists focused on a company-wide team effort 
to identify and isolate secreted proteins for potential therapeutic use (SPDI). For the SPDI project 
my duties were, among other things, the critically important coordination of the cloning of 
thousands of putative genes, by developing a smooth process of communication between the 
Bioinformatics, Cloning, Sequencing, and Legal teams. Collaborated with several groups to 
discover novel genes through the Curagen project, a unique differential display methodology. 
Interacted extensively with the Legal team providing essential data needed for filing patents on 
novel genes discovered through the SPDI, TAP and Curagen projects. My group has developed, 
implemented and patented high throughput cloning methodologies that have proven to be 
essential for the isolation of hundreds of novel genes for the SPDI, TAP and Curagen projects as 
well as dozens of other smaller projects. 

Scientist DNAX Research Institute, Palo Alto; 9/91 to 1/99 

< Involved in multiple projects* aimed ateunderstanding novel genes discovered through 
bioinformatics studies and functional assays. Developed and patented a method for the specific 
depletion of eosinophils in vivo using monoclonal antibodies. Developed and implemented 
essential technical methodologies and provided strategic direction in the areas of expression, 
cloning, protein purification, general molecular biology, and monoclonal antibody production. 
Trained and supervised numerous technical staff . 

Facilities 

Manager Corixa, Redwood City; 5/89 -7/91. 

Directed plant-related activities, which included expansion planning, maintenance, safety, 
purchasing, inventory control, shipping and receiving, and laboratory management. Designed 
and implemented the safety program. Also served as liaison to regulatory agencies at the local, 
state and federal level. Was in charge of property leases, leasehold improvements, etc. 
Negotiated vendor contracts and directed the purchasing department. Trained and supervised 
personnel to carry out the above-mentioned duties. 



o 
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SRA University of California, San Francisco 

Cancer Research Institute; 2/87-4/89. 

Was responsible for numerous cloning projects including: studies of somatic hypermutation, 
studies of AEDS-associated lymphomas, and cloning of t(5; 14), t(l 1 ; 14), and t(8; 14) 
translocations. Focused on the activation of hemopoietic growth factors involved in the t(5;14) 
translocation in leukemia patients.. 

Research 

Technician Berlex Biosciences, South San Francisco; 7/85-2/87. 

Worked on a subunit porcine vaccine directed against Mycoplasma hyopneumoniae. Was 
responsive igr generating genomic Ubrariejs, screening with degenerate oligonucleotides, and 
characterizing and* expressing clones in E. coli. Also constructed a general purpose expression 
vector for use by other scientific teams. 

PUBLICATIONS 

1. Hilary F. Clark, et al. "The Secreted Protein Discovery Initiative (SPDI), a Large-scale 
Effort to Identify Novel Human Secreted and Transmembrane Proteins: a bioinformatics 
assessment." Genome Res. Vol 13(10), 2265-2270, 2003 

2. Sean H. Adams, Clarissa Chui' Sarah L. Schilbach, Xing Xian Yu, Audrey D. Goddard, J. 
Christopher Grimaldi, James Lee, Patrick Dowd, David A. Lewin, & Steven Colman* "BFIT, 
a Unique Acyl-CoA Thioesterase Induced in Thermogenic Brown Adiopose Tissue: Cloning, 
organization of the humanb gene and assessment of a potential link to obesity" Biochemical 
Journal, Vol 360, 135-142, 2001 

3. Szeto W, Jiang W, Tice DA, Rubinfeld B, Hollingshead PG, Fong SE, Dugger DL, Pham T, 
Yansura D, Wong TA, Grimaldi JC, Corpuz RT, Singh JS, Frantz GD, , Devaux B, Crowley 
CW,.Schwall RH, Eberbard DA*£a&telli L, PolakikP, and Pennica D. "Overexpression of— 
the Retenoic Acid-Responsive Gene Stra6 in Human Cancers and its Synergistic Activation 
by Wnt-1 and Retinoic Acid." Cancer Research Vol. 61(10), 4197-4205, 2001 

4. Jeanne Kahn, Fuad Mehraban, Gladdys Ingle, Xiaohua Xin, Juliet E. Bryant, Gordon Vehar, 
Jill Schoenfeld, J. Christopher Grimaldi (incorrectly named as "Grimaldi, CJ"), Franklin 
Peale, Aparna Draksharapu, David A. Lewin, and Mary E. Gerritsen. "Gene Expression 
Profiling in an in Vitro Model of Angiogenesis." American Journal of Pathology Vol 156(6), 
1887-1900,2000. 

5. Grimaldi JC, Yu NX, Grunig G, Seymour BW, Cottrez F, Robinson DS, Hosken N, Ferlin 
WG, Wu X, Soto H, O'Garra A, Howard MC, Coffman RL. "Depletion of eosinophils in 
mice through the use of antibodies specific for C-C chemokine receptor 3 (CCR3). Journal of 
Leukocyte Biology; Vol. 65(6), 846-53, 1999 

6. Oliver AM, Grimaldi JC, Howard MC, Kearney JF. "Independently ligating CD38 and Fc 
gammaRIIB relays a dominant negative signal to B cells." Hybridoma Vol. 18(2), 1 13-9, 
1999 



o 
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7. Cockayne DA, Muchamuel T, Grimaldi JC, Muller-Steffher H, Randall TD, Lund FE, 
Murray R, Schuber F, Howard MC. 'Mice deficient for the ecto-nicotinamide adenine 
dinucleotide glycohydrolase CD38 exhibit altered humoral immune responses." Blood Vol. 
92(4), 1324-33, 1998 

8. Frances E. Lund, Nanette W. Solvason, Michael P. Cooke, Andrew W. Heath, J: Christopher 
Grimaldi, Troy D. Randall, R. M. E. Parkhouse, Christopher C Goodnow and Maureen C. 
Howard. "Signaling through murine CD38 is impaired in antigen receptor unresponsive B 
cells." European Journal of Immunology, Vol. 25(5), 1338-1345, 1995 

9. M. J. Guimaraes, J. F. Bazan, A. Zolotnik, M. V. Wiles, J. C. Grimaldi, F. Lee, T. 
McClanahan. "A new approach to the study of haematopoietic development in the yolk sac 
and embryoid body." Development, Vol. 121(10), 3335-3346, 1995 

1Q. h Christopher Grimaldi, Sriram Balasubramanian, J. Fepiando Bazan, Armen Shanafelt, 
Gerard Ziirawski and Maureen Howard. "CD38-mediated protein ribosylation." Journal of 
Immunology, VoL 155(2), 811-8 17, 1995 

1 1. Leopoldo Santos- Argumedo, Frances F. Lund, Andrew W. Heath, Nanette Solvason, Wei 
Wei Wu, J. Christopher Grimaldi, R. M. E. Parkhouse and Maureen Howard. "CD38 
unresponsiveness of xid B cells implicates Bruton's tyrosine kinase (btk) as a regulator of 
CD38 induced signal transduction," International Immunology, Vol 7(2), 163-170, 1995 

12. Frances Lund, Nanette Solvason, J. Christopher Grimaldi, R. M. E. Parkhouse and Maureen 
Howard. "Murine CD38: An immunoregulatory ectoenzyme." Immunology Today, Vol. 
16(10), 469-473, 1995 

13. Maureen Howard, J. Christopher Grimaldi, J. Fernando Bazan, Frances E. Lund, Leopoldo 
Santos- Argumedo, R. M. E. Parkhouse, Timothy F. Walseth, and Hon Cheung Lee. 
"Formation and Hydrolysis of Cyclic ADP-Ribose Catalyzed by Lymphocyte Antigen 
CD38." Science, Vol. 262, 1056-1059, 1993 

14. Nobuyuki Harada, l^polda?Santos-Argumedo^ Ray Chang, J. Christopher Grimaldi, ^Frances 
Lund, Camilynn I. Brannan, Neal G. Copeland, Nancy A. Jenkins, Andrew Heath, R. M. E. 
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The t(5;14) Chromosomal Translocation in a Case of Acute Lymphocytic 
Leukemia Joins the Interleukin-3 Gene to the Immunoglobulin Heavy Chain Gene 

By J. Christopher Grimaldi and Timothy C. Meeker 



Chromosomal translocations have proven to be important 
markers of the genetic abnormalities central to the patho- 
genesis of cancer. By cloning chromosomal breakpoints 
one can identify activated proto-oncogenes. We have stud- 
ied a case of B-Kneage acute lymphocytic leukemia (ALL) 
that was associated with peripheral blood eosinophilia. The 
chromosomal translocation t(5;14) (q31;q32) from this 
sample was cloned and studied at the molecular level. This 

KARYOTYPIC STUDIES of leukemia and lymphoma 
have identified frequent nonrandom chromosomal 
translocations. Some of these translocations juxtapose the 
immunoglobulin heavy chain (IgH) gene with important 
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Fig 1. DNA blots of the leukemia sample. The restriction 
fragment pattern of normal human DNA <N) and the leukemia 
sample (L) were compared using a human Jh probe. Rearranged 
bands are indicated by arrows. Sample L exhibits a single rear- 
ranged band with both Hind Hl/fcoRI and SaU3A restriction 
digests. The rearranged bands are less intense than the other 
bands because the majority of cells In the sample represent normal 
bone marrow elements. 



translocation joined the immunoglobulin heavy chain join- 
ing (Jh) region to the promotor region of the interleukin-3 
(IL-3) gene in opposite transcriptional orientations. The 
data suggest that activation of the IL-3 gene by the 
enhancer of the immunoglobulin heavy chain gene may play 
a central role in the pathogenesis of this leukemia and the 
associated eosinophilia. 
e 1 989 by Grune & Stratton, Inc. 

protooncogenes, such as c-myc and 6c/-2. w In this way, the 
IgH gene can activate proto-oncogenes, resulting in disor- 
dered gene expression and a step in the development of 
cancer. The investigation of additional nonrandom transloca- 
tions into the IgH locus allows us to identify new genes 
promoting the generation, of leukemia and lymphoma. 

A distinct subtype of acute lymphocytic leukemia (ALL) 
has been characterized by B-lineage phenotype, associated 
eosinophilia in the peripheral blood, and a t(5;14)(q31;q32) 
chromosomal translocation. 3,4 This syndrome probably 
occurs in <1% of all patients with ALL. We hypothesized 
that the cloning of the translocation characteristic of this 
leukemia might allow the identification of an important gene 
on chromosome 5 that plays a role in the evolution of this 
disease. In this report we demonstrate that the interleukin-3 
gene (IL-3) and the IgH gene are joined by this transloca- 
tion. 

MATERIALS AND METHODS 

Sample and DNA blots, A bone marrow aspirate from a repre- 
sentative patient with ALL (LI morphology by French-American- 
British [FAB] criteria), peripheral eosinophilia (up to 20,000 per 
microliter with a normal value of <350 per microliter) and a 
t(5;14)(q31;q32) translocation was studied. Using published meth- 
ods, genomic DNA was isolated and DNA blots were made. 5 Briefly, 
10 Mg of high molecular weight (mo! wt) DNA were digested using 
an appropriate restriction enzyme and electrophoresed on a 0.8% 
agarose gel; The gel was stained with ethidium bromide, photo- 
graphed, denatured, neutralized, and transferred to Hybond (Amer- 
sham, Arlington Heights, IL). After treatment of the filter with 
ultraviolet light, hybridization was performed. The filter was washed 
to a final stringency of 0.2% saturated sodium citrate (SSC) and 
0.1% sodium lauryly sulfate (SDS) and exposed to film. The human 
Jh probe has been previously reported. 6 

Genomic library. The genomic library was made using pub- 
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lishcd methods. 5 Approximately 100 fig of high mol wt genomic 
DNA were partially digested with the SaulA restriction enzyme. 
Fragments from 9 to 23 kilobases (kb) in size were isolated on a 
sucrose gradient and ligated into phage EMBL3A (Strategene, San 
Diego). Recombinant phage were packaged, plated, and screened as 
previously reported. 5 

DNA sequencing. Fragments for sequencing were cloned into 
Ml 3 vectors and sequenced by the chain termination method using 
Sequenase (United States Biochemical, Cleveland). 7 All sequence 
data were derived from both strands. 

RESULTS 

We studied a bone marrow sample from a patient with 
ALL and associated peripheral eosinophilia. Karyotypic 
analysis showed the characteristic t(5;14)(q31;q32) translo- 
cation. These features define a distinctive subtype of ALL. 3,4 
The leukemic cells were analyzed for cell surface phenotype 
by immunofluorecence. They were positive for Bl (CD20), 
B£ ; (CD19), cALLA T (CD 1 0), HLA-DR, and terminal 
deoxynucleotidyl transferase (Tdt), but negative for surface 
immunoglobulin. This phenotypic profile describes an imma- 
ture cell from the B-lymphocytic lineage. 8 

The leukemia DNA was analyzed by Southern blotting for 
rearrangements of the IgH gene. Using a human immuno- 
globulin Jh probe, a single rearranged band was detected by 
EcoRl, Hindlll Sstl, Sau3A y and EcoRl plus Hindlll 
restriction digests, suggesting rearrangement of one allele 
(Fig 1). The immunoglobulin Jh region from the other allele 
was presumably either deleted or in the germline configura- 
tion. 

We hypothesized that the t(5;l4)(q31;q32) juxtaposed a 



growth-promoting gene on chromosome 5 with the immuno- 
globulin Jh region on chromosome 14. Therefore, a genomic 
library was made from the leukemic sample and screened 
with a Jh probe. Fifteen distinct positive clones were isolated 
and screened for the presence of the rearranged SaulA 
fragment that was detected by DNA blotting. By this 
analysis, five clones appeared to represent the rearranged 
allele identified by DNA blots. One of these clones (clone no. 
4) was chosen for further study and a detailed restriction 
map was generated. The EcoRl Hindlll/ EcoRl and Sstl 
fragments from clone no. 4 that hybridized to the human Jh 
probe were also identical in size to the rearranged fragments 
from the leukemia sample, confirming that clone no. 4 
represented the rearranged leukemic allele. 

Phage clone no. 4 contained 3.7 kb of unknown origin 
joined to the IgH gene in the region of Jh4 (Fig 2). The IgH 
gene from Jh4 to the Cmu region appeared to be in germline 
configuration. Previously,- the gene encoding hematopoietic 
growth factor IL-3 had been mapped to chromosome 5q3 1 so 
it was suspected that clone no. 4 might contain part of this 
gene. 9 ' 12 When the restriction map of human IL-3 and clone 
no. 4 were compared, they were identical for more than 3 kb 
(Fig 2). 

We confirmed the juxtaposition of the IL-3 gene and the 
IgH gene by nucleic acid sequencing of the subcloned 
BstEll/Hpal fragment (Fig 2). The sequence of this frag- 
ment showed no disruption of the protein coding region or the 
messenger RNA of the IL-3 gene. The break in the IL-3 gene 
occurred in the promoter region, 452 base pairs (bp) 
upstream of the transcriptional start site (position 64, Fig 
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3A). The break in the IgH gene occurred 2 bp upstream of 
the Jh4 region. Between the two breaks, 25 bp of uncertain 
origin (putative N sequence) were inserted. 13,14 No sequences 
homologous to the immunoglobulin heptamer and nonamer 
could be identified in the IL-3 sequence (Fig 3B). Therefore, 
nucleic acid sequencing confirmed the juxtaposition of the 
IL-3 gene and the IgH gene. The sequence data clearly 
showed that the genes were positioned in opposite transcrip- 
tional orientations (head-to-head). 

Available data also allowed us to determine the normal 
positions of the IL-3 gene and the GM-CSF gene in relation 
to the centromere of chromosome 5 (Fig 4). The IgH gene is 
known to be positioned with the variable regions toward the 
telomere on chromosome 14q. u5 It has also been shown that 



2083 



GM-CSF maps within 9 kb of IL-3 in the same transcrip- 
tional orientation. 16 Using this information and assuming a 
simple translocation event in our sample, we can conclude 
that the IL-3 gene is normally more centromeric, and the 
GM-CSF gene more telomeric on chromosome 5q (Fig 4). 
Furthermore, both are transcribed with their 5' ends toward 
the centromere. 

DISCUSSION 

In this report we have cloned a unique chromosomal 
translocation that appears to be a consistent feature of a rare, 
yet distinct, clinical form of acute leukemia. This transloca- 
tion joined the promotor of the IL-3 gene to the IgH gene. 
Except for the altered promotor, the IL-3 gene appeared 
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Rg 4. Diagram of the translocation. The normal chromosome 
6q31 is shown with the GM-CSF gene telomedc to the IL-3 gene in 
the transcriptional orientation shown. On norma! chromosome 
14q32 the Vh regions are telomeric. The t(6;14)(q31 ;q32) translo- 
cation results in the head-to-head orientation of these genes. 
Symbols are defined in Fig 2, BP, breakpoint position. 



intact as no deletions, insertions, or point mutations were 
detected by restriction mapping of the entire gene and 
sequencing of part of the gene. The IgH gene has been 
truncated at the Jh4 region, which places the immunoglobu- 
lin enhancer within 2.5 kb of the IL-3 gene. 1718 This leads to 
the hypothesis that the enhancer is increasing transcription 
of a structurally normal IL-3 gene. The same mechanism is 
important for activation of the c-myc gene in some cases of 
Burkitt's lymphoma. 19 An alternate hypothesis is that the 
elimination of an upstream IL-3 promotor element is crucial 
to the activation of the IL-3 gene. 

The proposed activation of the IL-3 gene suggests that an 
autocrine loop is important for the pathogenesis of this 
leukemia. Over-expression of the IL-3 gene coupled with 
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the presence of the IL-3 receptor in these cells could account 
for a strong stimulus for proliferation. In this regard, there 
are data indicating that immature B-lineage lymphocytes 
and B-lineage leukemias may express the IL-3 receptor. 21 - 22 

An additional feature of this type of leukemia is the 
dramatic eosinophilia, consisting of mature forms. It has 
been hypothesized that the eosinophils do not arise from the 
malignant clone, but are stimulated by the tumor. 23 * 24 
Because of the known effect of IL-3 on eosinophil differentia- 
tion, secretion of high levels of IL-3 by leukemic cells might 
have a role in the eosinophilia in this type of leukemia. 12 

The data suggest that the recombination mechanism that 
is active in the IgH gene during normal differentiation has a 
role in this translocation. 13,14 This is supported by the break- 
point location at the 5' end of Jh4 and the presence of 
putative N-region sequences. On the other hand, no recombi- 
nation signal sequence (heptamer andjionamer) was found 
in this r region on chromosome 5, suggesting that additional 
factors also played a role. Further studies will elucidate the 
mechanism of this and other translocations. 

In the leukemia we studied, it is possible that the immuno- 
globulin enhancer also activates the GM-CSF gene, since 
this gene is probably positioned only 14 kb away (Fig 4). This 
is known to be within the range of enhancer activation. 25 The 
interleukin-5 (IL-5) gene maps to chromosome 5q31. 26 
Deregulation of the IL-5 gene by this translocation would act 
synergistically with IL-3 in the stimulation of eosinophil 
proliferation and differentiation. 27 These and other questions 
will be answered by the study of more patient samples: We 
plan to determine whether the t(5;14)(q31;q32) transloca- 
tion is capable of activating multiple lymphokines simulta- 
neously and whether they cooperate in the generation of this 
leukemia. 
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RAPID COMMUNICATION 

Activation of the InterIeukin-3 Gene by Chromosome Translocation in Acute 

Lymphocytic Leukemia With Eosinophilia 

By Timothy C. Meeker, Dan Hardy, Cheryl Willman, Thomas Hogan, and John Abrams 



The t(5;14)(q31;q32) translocation from IMineage acute 
lymphocytic leukemia with eosinophilia has been cloned 
from two leukemia samples. In both cases, this transloca- 
tion Joined the IgH gene and the interleukin-3 (IL-3) gene. In 
one patient, excess IL-3 mRNA was produced by the 
leukemic cells. In the second patient, serum IL-3 levels 
were measured and shown to correlate with disease 

A NUMBER OF chromosome translocations have been 
associated with human leukemia and lymphoma. In 
many cases the study of these translocations has led to the 
discovery or characterization of proto-oncogenes, such as 
bcl-2 t c-flW, and omyc, that are located Sdjacent to the 
translocation. 1 * 2 It is now widely understood that cancer- 
associated translocations disrupt nearby proto-oncogenes. 

A distinct subtype of acute leukemia is characterized by 
the triad of B-lineage immunophenotype, eosinophilia, and 
the t(5;14)(q31;q32) translocation. 3,4 Leukemic cells from 
such patients have been positive for terminal deoxynucleotidyl 
transferase (Tdt), common acute lymphoblastic leukemia 
antigen (CALLA), and CD 19, but negative for surface or 
cytoplasmic immunoglobulin. In previous work, we cloned 
the t(5;14) breakpoint from one leukemic sample (Case 1) 
and determined that the IgH and interleukin-3 (IL-3) genes 
were joined by this abnormality. 5 In this report, we extend 
those findings by showing that the t(5;14)(q31;q32) translo- 
cation from a second leukemia sample (Case 2) has a similar 
structure, and we report our study of growth factor expres- 
sion in these patients. 

MATERIALS AND METHODS 

Samples and Southern blots. Case 1 has been described. 5 * 6 
Clinical features of Case 2 have' been described in detail. 3 DNA 
isolation and Southern blotting was done using previously described 
methods. 3 Filters were hybridized with an immunoglobulin Jn probe, 
a 280 bp BamHl/EcoKl genomic IL-3 fragment, and an IL-3 
cDNA probe. 7 -* 

Northern blots. RNA isolation and Northern blotting have been 
described. 9 Briefly, Northern blots were done by separating 9^g 
total RNA on 1% agarose-formaldehyde gels. Equal RNA loading in 
each lane was confirmed by ethidium bromide staining. Blots were 
hybridized with an IL-3 cDN A probe extending to the Xho I site in 
exon 5, a 720 bp Sst l/Kpn I probe derived from intron 2 of the IL-3 
gene, a 600 bp Nhe l/Hpa I IL-5 cDNA probe, and a 500 bp Pst 
l/Nco I granulocyte-macrophage colony stimulating factor (GM- 
CSF) cDNA probe. 10 " 12 

Polymerase chain reaction. Primers were designed with BamHl 
sites for cloning. One primer hybridized to the Jh sequences from the 
IgH gene (Primer 1 44 : 5'-TAGG ATCCG ACGOTG ACCAGGGT) , 
and the other hybridized to the region of the TATA box in the IL-3 
gene (Primer 161: 5'-AACAGGATCCCGCCTTATATGTGCAG). 
Polymerase chain reaction (PCR) (95°C for 1 minute, 61°C for 30 
seconds, and 72°C for 3 minutes) was done using 500 ng genomic 
DNA and 50 pmol of each primer in 100 uL containing 67 mmol/L 
Tris-HCl pH 8.8, 6.7 mmol/L MgCl 2 , 10% dimethyl sulfoxide 
(DMSO), 170 Mg/mL bovine serum albumin (BSA) (fraction V), 



activity. There was no evidence of excess granulocyte/ 
macrophage colony stimulating factor (GM-CSF) or IL-5 
expression. Our data support the formulation that this 
subtype of leukemia may arise in part because of a 
chromosome translocation that activates the IL-3 gene, 
resulting in autocrine and paracrine growth effects. 
© 1990 by The American Society of Hematology. 

16.6 mmol/L ammonium sulfate, 1 .5 mmol/L each dNTP and Taq 
polymerase (Perkin-Elmer, Norwalk, CT). 13 

Sequencing. Sequencing was done by chain termination in M 1 3 
vectors. 14 As part of this study, we sequenced a subclone of a normal 
1Lr3 promoter, covering 598 bas$j>airsJrom a Sma I site at position 
- 1240 (wtttrrespect to the proposed site of transcription initiation) 
to an Nhe I site at position —642. The plasmid containing this region 
was a gift from Naoko Arai of the DNAX Research Institute. 

Expression in Cos? cells. A genomic IL-3 fragment from Case 1 
was cloned into the pXM expression vector. 10 Briefly, the Hindlll/ 
Sal I fragment containing the IL-3 gene was subcloned from the 
previously .described phage clone 4 into pUC18. 5 The 2.6 kb 
fragment extending from the Sma I site 61 bp upstream of the IL-3 
transcription start to the Sma I site in the polylinker was cloned into 
the blunted Xho I site of pXM. The negative control construct was 
the pXM vector without insert. Plasmids were introduced into Cos 7 
cells by electroporation, and supernatant was collected after 48 
hours in culture. 

TF1 bioassay. TF-1 cells were passaged in RPMI 1640 supple- 
mented with 10% heat-inactivated fetal bovine serum, 2 mmol 
L-glutamine, and 1 ng/mL human GM-CSF. 15 Samples and antibod- 
ies were diluted in this same medium lacking GM-CSF but contain- 
ing penicillin and streptomycin. A 25 mL volume of serial dilutions of 
patient serum was added to wells in a flat bottom 96-well microtiter 
plate. Rat anti-cytokine monoclonal antibody in a volume of 25 $iL 
was added to appropriate wells and preincubated for 1 hour at 37°C. 
Fifty microliters of twice washed TF-1 cells were added to each well, 
. giving a final cell^ concentration of 1 x 10 4 cells per well (final 
volume, 100 uh). The plate was incubated for 48 hours. The 
remaining cell viability was determined metabolically by the colori- 
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Fig 1. Breakpoint sequences for Case 2. The germline IgJhB region sequence (protein coding region and recombination filarial 
sequences are underlined) is on top, the translocation sequence from Case 2 (PCR primer sequences and putative N region are undo r adl 
is in the middle, and the germline IL-3 sequence, which we derived from a normal IL-3 done, is on the bottom. 7 + indicates that " eh 
sequence has the same nucleotide. The sequence documents the head-to-head Joining of the IL-3 and IgH genes The braalrnoin* i» *k ^To 
gene occurred at position -934 (•). •wpwm in tne il-3 



Clone 



IL3 



metric method of Mosmann using a VMax microtiter plate reader 
(Molecular Devices, Menlo Park, CA) set at 570 and 650 nm." 

Cytokine immunoassays. These assays used rat monoclonal 
anti-cytokine antibodies (10 ng/mh) to coat the wells of a PVC 
microliter plate. The capture antibodies used were -BVD3-6G8, 
JES1-30D1O, and BVD2-23B6, for the IL-3, IL-5, and GM-CSf~ 
assays, respectively. Patient sera were then added (undiluted and 
diluted 1:2 for IL-3, undiluted for IL-5, and undiluted and diluted 
1:5 for GM-CSF). The detecting immunoreagents used were either 
mouse antiserum to IL-3 or nitroiodophenyl (NlP)-derivatized rat 
monoclonal antibodies JESI-5A2 and BVD2-21C11, specific for 
IL-5 and GM-CSF, respectively. Bound antibody was subsequently 
detected with immunoperoxidase conjugates: horseradish peroxidase 
(HRP)-labeled goat anti-mouse Ig for IL-3, or HRP-labeled rat (J4 
MoAb) anti-NIP for IL-5 and GM-CSF. The chromogenic sub- 
strate was 3-3'azino-bis-benzthiazoline sulfonate (ABTS; Sigma, St 
Louis, MO). Unknown values were interpolated from standard 
curves prepared from dilutions of the recombinant factors using 
Softmax software available with the VMAX microplate reader 
(Molecular Devices). 



RESULTS 

Leukemic DNA from Case 2 was studied by Southern 
blotting. When digested with the tfiwilll restriction enzyme 
and hybridized with a human immunoglobulin heavy chain 
joining region (Jh) probe, a rearranged fragment at approxi- 
mately 14 kb was detected (data not shown), Wheff reprobed 
with either of two different IL-3 probes, a rearranged 14 kb 



fragment, emigrating with the rearranged Jh fragment, was 
identified. When leukemic DNA was digested with Kindlll 
plus £coRI, a rearranged Jh fragment was detected at 6 kb. 
The IL-3 proles also identified a comigrating fragment of 
this size. These experiments indicated that the leukemic 
sample studied was clonal and that a single fragment 
contained both Jh arid IL-3 sequences, suggesting a translo- 
cation had occurred. 

To characterize better the joining of the IL-3 gene and the 
immunoglobulin heavy chain (IgH) gene, the polymerase 
chain reaction (PCR) was used to clone the translocation. 13 
A Jh primer and an IL-3 primer were designed to produce an 
amplified product in the event of a head-to-head transloca- 
tion. While control DNA gave no PCR product, Case 2 DNA 
yielded a PCR-derived fragment of approximately 980 bp, 
which was cloned and sequenced. 

The DNA sequence of the translocation clone from Case 2 
confirmed the joining of the Jh region with the promotor of 
the IL-3 gene in a head-to-head configuration (Fig 1) 
Sequence analysis indicated that the breakpoint on chromo- 
some 14 was just upstream of the Jh5 coding region. The 
breakpoint on chromosome 5 occurred 934 bp upstream of 
the putative site of transcription initiation of the IL-3 gene. 
We also determined that a putative N sequence of 17 bp was 
'inserted between the chromosome 5 and chromosome 14 
sequences during the translocation event. 1 ™ Figure 2 shows 
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Fig t. Relationship of chromosome 5 breakpoints to the IL-3 gene. This figure shows the two cloned breakpoints (arrows) in relation to 
the normal IL-3 gene. 8 - 10 One breakpoint occurred at position -462 and the other at -934 (arrows). In both circumstances the 
translocations resulted in a head-to-head Joining of the IgH gene and the IL-3 gene, leaving the mRNA ami protein coding regions of the IL-3 
gene intact. Boxes denote the five IL-3 exons; restriction enzymes are (B) BamHI, (P) Pat I, (H) Hpa I, (E) EcoRI, and <X) Xho I 
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Fig 3. Documentation of IL-3 mRN A over-expression. A Northern blot was prepared and hybridized with a probe for IL-3. Lane 1 
contained RNA from unstimulated peripheral blood lymphocytes (PBL) as a negative control. Lane 2 contained RNA from PBL stimulated for 
4 hours whh concanavalian A (ConA), and lane 3 contained RNA from PBL stimulated with Con A for 48 hours. As in the positive control 
lanes (2 and 3). a 1 kb band was identified in the leukemic sample from Case 1 {lane 4, lower arrow), suggesting aberrant expression of the 
IL-3 gene, m addition, the leukemic sample showed over-expression of an unspiiced 2.9 kb IL-3 transcript (lane 4, upper arrow). We 
documented that this represented an unspiiced precursor of the mature 1 kb transcript by showing that this band hybridized to a probe 
front intron 2 of the IL-3 gene. A similar 2.9 kb band was detected WlSfflr?, suggesting that an IL-3 mRMA of this size Is sometimes 
detectable in normal mitogen-stimulated cells. Lane 6 through 10 represent RNA from six samples of B-lineage acute lymphocytic leukemia 
without the t{5;1 4) translocation, indicating that only the sample with the translocation exhibited IL-3 over-expression. Case 2 could not be 
analyzed by Northern blot because too few cells were available for study. 



the locations of the two cloned breakpoints in relation to the 
IL-3 gene. The two chromosome 5 breakpoints were sepa- 
rated by less than 500 bp. 

The genomic structure in Cases 1 and 2 suggested that a 
normal IL-3 gene product was over-expressed as a result of 
the altered promotor structure. This would predict that the 
IL-3 gene on the translocated chromosome was capable of 
making IL-3 protein. This prediction was tested by express- 
ing a genomic fragment from the translocated allele of Case 
1 containing all five IL-3 exons under the control of the SV40 
promotor/enhancer in the Cos7 cell line. Cell supernatants 
were studied in a proliferation assay using the factor depen- 
dent erythroleukemic cell line, TF-1. The supernatants 
derived from transfections using the vector plus insert 
supported TF-1 proliferation, while supernatants from trans- 
fections using the vector alone were negative in this assay 
"Cdata Abt shown). Furthermore, the biologic activity could be 
blocked by an antibody to human IL-3 (BVD3-6G8). This 
result showed that the translocated allele retained the ability 
to make IL-3 mRNA and protein. 

The level of expression of IL-3 mRNA in leukemic cells 
from Case 1 was assessed. Northern blotting showed that the 
mature IL-3 mRNA (approximately 1 kb) and a 2.9 kb 
unspiiced IL-3 mRNA were excessively produced by the 
leukemia (Fig 3). The 2.9 kb form of the mRNA is also 
present at low levels in normal peripheral blood T lympho- 
cytes after mitogen activation (Fig 3). Several B-lineage 
acute leukemia samples without the t(5;14) translocation 
had undetectable levels of IL-3 mRNA in these experiments. 
In addition, although genes for GM-CSF and IL-5 map close 
to the IL-3 gene and might have been deregulated by the 
translocation, no IL-5 or GM-CSF mRNA could be detected 
in the leukemic sample (data not shown). 19,20 

Three serum samples from Case 2 were assayed by 
immunoassay for levels of IL-3, GM-CSF, and IL-5 (Table 
1). Serum IL-3 could be detected and correlated with the 
clinical course. When the patient's leukemic cell burden was 



highest, the IL-3 level was highest. No serum GM-CSF or 
IL-5 could be detected. 

Since the IL-3 immunoassay measured only immunoreac- 
tive factor, we confimed that biologically active IL-3 was 
present by using the TF-1 bioassay. This bioassay can be 
rendered monospecific using appropriate neutralizing mono- 
clonal antibodies specific for IL-3, IL-5, or GM-CSF. We 
observed that sera from 1-16-84 and 3-14-84 contained TF-1 
stimulating activity that could be blocked with anti-IL-3 
MoAb (BVD3-6G8), but not with MoAbs to IL-5 (JES1- 
39D10) or GM-CSF (BVD2-23B6) (Fig 4; GM-CSF data 
not shown). The amount of neutralizable bioactivity in these 
two samples correlated very well with the difference in IL-3 
levels obtained by immunoassay for these samples. Further- 
more, the failure to block TF-1 proliferating activity with 
either^ftnti-IL-5 or anti-GM-CSF was consistent with^hSr. 
inability to measure these factors by immunoassay and 



Table 1 . Peripheral Blood Counts and Growth Factor Levels 
at Different Times in Case 2 

Sample Date 





11/16/83 


1/16/84 


3/14/84 


Peripheral blood counts [ce\\s/pL) 








WBC 


81.800 


116,500 


12,300 


Lymphoblasts 


0 


33,785 


0 


Eosinophils 


46,626 


73,080 


615 


Serum growth factor levels (pg/mL) 








IL-3 


' <444 


7,995 


1,051 


GM-CSF 


<16 


<15 


<16 


IL-5 


<60 


<50 


<60 



Peripheral blood counts from Case 2 at three different time points with 
the corresponcfing growth factor levels quantified by immunoassay. The 
patient received chemotherapy between 1/1 6/84 and 3/1 4/84 to lower 
his leukemic burden. 3 No serum samples were available for a similar 
analysis of Case 1. 

Abbreviation: WBC, white blood cells. 
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Fig 4. Bioassay of serum IL-3. Leukemic patient sera were tested for bioaethra lua and it r i_ .u« « « 
reclprocalof the dilution ia Indicted on the horbontal axl^ ~ , ."J proW9r8tion "*»»*• The 



indicated that these other myeloid growth factors were not 
detectably circulating in the serum of this patient. 

DISCUSSION 

In this report, we have extended our analysis of acute 
lymphocytic leukemia and eosinophilia associated with the 
t(5;14) translocation. In both cases we have studied, we have 
documented the joining of the IL-3 gene from chromosome 5 
to the IgH gene from chromosome 14. The breakpoints on 
chromosome 5 are within 500 bp of each other, suggesting 
that additional breakpoints will be clustered in a small region 
of the IL-3 promotor. The PCR assay we have developed will 
be useful in the screening of additional clinical samples for 
this abnormality. 

The finding of a disrupted IL-3 promotor associated with 
an otherwise normal IL-3 gene implied that this transloca- 
tion might lead to the over-expression of a normal IL-3 gene 
product. In this work, we have documented that this is tjue. 
In addition, neither GM-CSF nor IL-5 are over-expressed by 
the leukemic cells. Furthermore, in one patient, serum IL-3 
could be measured and correlated with disease activity. To 
our knowledge, this is the first measurement of human IL-3 
in serum and its association with a disease process. The 
measurement of serum IL-3 in this and other clinical settings 
may now be indicated . 



The finding of the IL-3 gene adjacent to a cancer- 
associated translocation breakpoint suggests that its activa- 
tion is important for oncogenesis. It is our thesis that an 
autocrine loop for IL-3 is important for the evolution of this 
leukemia. 21 The excessive IL-3 production that we have 
documented would be one feature of such an autocrine loop. 
The final proof of our thesis must await additional data. In 
particular, from the study of additional clinical samples, it 
will be necessary to document that the IL-3 receptor is 
present on the leukemic cells and that anti-IL-3 antibody 
decreases proliferation of the leukemia in vitro. 

An important aspect of this work is the suggestion of a 
therapeu tic approach for this disease. If an autocrine loop for 
IL-3 can be documented in this disease, attempts to lower 
circulating IL-3 levels or block the interaction of IL-3 with 
its receptor may prove useful. Because it is also possible that 
the eosinophilia in these patients is mediated by the para- 
crine effects of leukemia^erjiyed IUJ^sjmjlar interventions 
may improve this aspect of the disease. Antibodies or 
engineered ligands to accomplish these goals may soon be 
available. 
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Clinical and Pathologic Significance of the 
c-erbB-2 (HER-2/neu) Oncogene 

Timothy P. Singleton and John G. Strickler 



The c-erkB-2 oncogene was first shown to have clinical significance in 1987 by 
Slam on et al, TO who reported that c-erfcB-2 DNA amplification in breast carcino- 
mas correlated with decreased survival in patients with metastasis to axillary 
lymph nodes. Subsequent studies, however, of c-erfeB-2 activation in breast 
carcinoma reached conflicting conclusions about its clinical significance. This 
oncogene also has been reported to have clinical and pathologic implications in 
other neoplasms. Our review summarizes these various studies and examines 
the clinical relevance of c-eriB-2 activation, which has not been emphasized in 
recent reviews. 37 38 ' 65 The molecular biology of the c-erfcB-2 oncogene has been 
extensively reviewed 37 * 38 ' 55 and will be discussed only briefly here. 



> 
? 

V 



BACKGROUND 

Hie c-erbB-2 oncogene was discovered in the 1980s by three lines of investiga- 
tion. The neu oncogene was detected as a mutated transforming gene in 
neuroblastomas induced by ethylnitrosurea treatment of fetal rats. 8 ' 73 * 74 * 78 The c- 
er&B-2 was a human gene discovered by its homology to the retroviral gene v- 
er&B. 33 ' 49 ' 76 HEfl-2 was isolated by screening a human genomic DNA library for 
homology with v-er&B. 24 When the DNA sequences were determined subse- 
quently, c-erfcB-2, HER-2, and neu were found to represent the same gene. 
Recently, the c-er&B-2 oncogene also has been referred to as NGL. 

The c-erfoB-2 DNA is located on human chromosome 17q21 84JMB and codes 
for oerfcB-2 mBNA (4.6 kb), which translates c-erfrB-2 protein (p!85). This 
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protein is a normal component of cytoplasmic membranes. The c-er&B-2 
oncogene is homologous with, but not identical to, c-erbB-1, which is located | 
on chromosome 7 and codes for thcrcpidcnnal growth factor receptor. 8103 The c- | 
erbB-2 protein is a receptor on cell membranes and has intracellular tyrosine 
kinase activity and an extracellular binding domain. 2 - 105 Electron microscopy 
with a polyclonal antibody detects c-crfcB-2 immunoreactivity on cytoplasmic 
membranes of neoplasms, especially on microvilli and the non-villous outer cell 
membrane. 61 In normal cells, immunohistochemical reactivity for c-erfcB-2 is 
frequently present at the basolateral membrane or the cytoplasmic membrane's 
brush border. 2 ^ 62 

There is experimental evidence that c-erbB-2 protein may be involved in 
the pathogenesis of breast neoplasia. Overproduction of otherwise normal c- 
erbB~2 protein can transform a ceU line into a malignant phenotype. 25 Also, 
when die n&& oifi$Sgene containing an activating point mutation is placed in 
transgenic mice with a strong promoter for increased expression, the mice 
develop multiple independent mammary adenocarcinomas. 18 ' 63 In other experi- 
j ments, monoclonal antibodies against the net* protein inhibit the growth (in 

j.| :'; nude mice) of a new-transformed cell line, 26 - 26 and immunization of mice with 

j!j ; neu protein protects them from subsequent tumor challenge with the neu- 

• transformed cell line." Some authors have speculated that the use of antago- 

'i ') nists for the unknown ligand could be useful in future chemotherapy. 85 Further 

av 



review of this experimental evidence is beyond the scope of this article. 



{{ f The c-erfoB-2 activation most likely occurs at an early stage of neoplastic 

development. This hypothesis is supported by the presence of oerbB-2 activa- 



tion in both in situ and invasive breast carcinomas. In addition, studies of 

: H metastatic breast carcinomas usually demonstrate uniform c-eriB-2 activation 

j* U at multiple sites in the same patient, «.immu« although <M?r*B-2 activation has 

% 1 1 rarely been detected in metastatic lesions but not in the primary tumor. 67 09107 

\jt \ .i Even more rarely, c-eriB-2 DNA amplification has been detected in a primary 

jf: ) j breast carcinoma but not in its lymph node metastasis. 5 In patients who have 

\ly bilateral breast neoplasms, both lesions have similar patterns of c-erftB-2 activa- 

\l * i tion, but only a few such cases have been studied. 11 



MECHANISMS OF c-erf>B-2 ACTIVATION 



: . j ? '■} The most common mechanism of c-erfcB-2 activation is genomic DNA amplifica- 

tion, which almost always results in oveiproduction of c-erfcB-2 mRNA and 
protein. lT ' 34 - 65 ' 81 The c-erfeB-2 amplification may stabilize the overproduction of 
mRNA or protein through unknown mechanisms. Human breast carcinomas 
with c-er&B-2 amplification contain 2 to 40 times more c-erfoB-2 DNA 4 * 3 and 4 to 
128 times more c-eriB-2 mRNA 34 * 90 than found in normal tissue. Most human 
breast carcinomas with c-erfcB-2 amplification have 2 to 15 times more c-er[>B-2 
DNA. Tbmors with greater amplification tend to have greater overproduc- 
i| ? • tion. 17 ' 52 ' 65 The non-mammary neoplasms that have been studied tend to have 
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similar levels of c-er&B-2 amplification or overproduction relative to the corre- 
sponding normal tissue. 

The second most common mechanism of oer£B-2 activation is overproduc- 
tion of c-erbB-2 mRNA and protein without amplification of c-erfcB-2 DNA, 81 
The quantities of mRNA and protein usually are less than those in amplified 
cases and may approach the small quantities present in normal breast or other 
tissues. ",60,52 The c-erfoB-2 protein overproduction without mRNA overproduc- 
tion or DNA amplification has been described in a few human breast carcinoma 
cell lines. 47 

Other rare mechanisms of c-eriB-2 activation have been reported. Translo- 
cations involving the c-erbB-2 gene have been described in a few mammary and 
gastric carcinomas, although some reported cases may represent restriction 
fragment length poly morphisms or incomplete restriction enzyme digestions 
that mimic t^milocations. 31 '^ 84 ' 80 ' 108 A single point mutation in the transmem- 
brane portion of neu has been described in rat neuroblastomas induced by 
ethylnitrosurea. 9 ' 55 The mutated neu protein has increased tyrosine kinase activ- 
ity and aggregates at the cell membrane. l W* Although there has been specula- 
tion that some of the amplified c-erfcB-2 genes may contain point mutations, 46 
none has been detected in primary human neoplasms. 41 » s3 ' 81 



TECHNIQUES FOR DETECTING C-ert>B-2 ACTIVATION 
Detection of c-erbB-2 DNA Amplification 

Amplification of oerbB-2 DNA 1s usually detected by DNA dot blot or South- 
ern blot hybridization. In the dot blot method, the extracted DNA is placed 
directly on a nylon membrane and hybridized with a c-erfcB-2 DNA probe. In 
the Southern blot method, the extracted DNA is treated with a restriction 
enzyme, and the fragments are separated by electrophoresis, transferred to a 
nylon membrane, and hybridized with a c-erbB-2 DNA probe. In both tech- 
niques, c-erfeB-2 amplification is quantified by comparing the intensity (mea- 
sured by densitometry) of the hybridization bands from the sample with those 
from control tissue. 

Several technical problems may complicate the measurement of c-erfcB-2 
DNA amplification. First, the extracted tumor DNA may be excessively de- 
graded or diluted by DNA from stromal cells, 61 Second, the c-erbB-2 DNA 
probe must be carefully chosen and labeled. For example, oligonucleotide c- 
erfcB-2 probes may not be sensitive enough for measuring a low level of c-er&B- 
2 amplification, because diploid copy numbers can be difficult to detect (unpub- 
lished data). Third, the total amounts of DNA in the sample and control tissue 
must be compensated for, often with a probe to an unamplified gene. Many 
studies have used control probes to genes on chromosome 17, the location of c- 
erbB-2, to correct for possible alterations in chromosome number. Identical 
results, however, are obtained by using control probes to genes on other chro- 
mosomes, 5 ® 80 with rare exception. n Studies using control probes to the beta- 
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168 T.P. SINGLETON AND J.G. STRICKLER 

globin gene must be interpreted with caution, because one allele of this gene is 
deleted occasionally in breast carcinomas. 3 

Amplification of c-erfcB-2 DNA was assessed by using the polymerase 
chain reaction (PCR) in one recent study. 32 Oligoprimers for the c-erfeB-2 gene 
and a control gene are added to the sample's DNA, and PCR is performed. If 
the sample contains more copies of oerbh-2 DNA than of the control gene, the 
c-eriB-2 DNA is replicated preferentially. 



t;i sue. Although the sensitivity may be increased by modifications that allow 

simultaneous detection of c-erfcB-2 DNA and mRNA, in situ hybridization still 



is cumbersome and expensive (unpublished data). 
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Detection of c-erbB-2 mRNA Overproduction ')i 

Overproduction of c-erfcB-2 mRNA usually is measured by RNA dot blot or 
Northern blot hybridization. Both techniques require extraction of RNA but \ 
otherwise are analogous to DNA dot blot and Southern blot hybridization. Use j 
of' PCR for-detection of oe?&B& mRN£ has been described in < two recent | 
abs tracts. * m 

Overproduction of c-er&B-2 mRNA can be measured by in situ hybridiza- 
tion, Sections are mounted on glass slides, treated with protease, hybridized £ f 
with a radiolabeled probe, washed, treated with nuclease to remove unbound \j 
probe, and developed for autoradiography Silver grains are seen only over 
tumor cells that overproduce c-erfcB-2 mRNA. Negative control probes are 
used. 6598106 Our experience indicates that these techniques are relatively insensi- 
tive for detecting c-erfcB-2 mRNA overproduction in routinely processed tis-r 
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All of the above c-er frB-2 mRNA detection techniques have several prob- 
lems that make them more difficult to perform than techniques for detecting 
DNA amplification. One major problem is the rapid degradation of RNA in 
I \ tissue that is not immediately frozen or fixed. In addition, during the detection | • \ 

procedure, RNA can be degraded by RNase; a ubiquitous enzyme, which must 
be eliminated meticulously from laboratory solutions. Third, control probes to 
genes that are uniformly expressed in the tissue of interest need to be carefully 
% selected. — *.-rs-- ■ | J 
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Detection of c-erbB-2 Protein Overproduction i 

The most accurate methods for detecting c-erhB-2 protein overproduction are || ( 

the Western blot method and immunoprecipitation. Both techniques can docu- $ • 

ment the binding specificity of various antibodies against c-erfcB-2 protein. In | i 

Western blot studies, protein is extracted from the tissue, separated by electro- | J 

phoresis (according to size), transferred to a membrane, and detected by using an- | 

tibodies to c-er&B-2. In immunoprecipitation studies, antibodies against c-er&B- | j 

2 are added to a tumor lysate, and the resulting protein-antibody precipitate is | • 

jj : separated by gel electrophoresis and stained for protein. Both Western blot and | ; : 

immunoprecipitation are usefiil research tools but currently are not practical for | ; | 

diagnostic pathology. Two recent abstracts have described an enzyme-linked J ! 
immunosorbent assay (ELISA) for detection of c-$rfcB-2 protein. 
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Overproduction of c-erfcB-2 protein is most commonly assessed by various 
immunohistochemical techniques. These procedures often generate conflicting 
results, which are explained at least partially by three factors. First, various 
studies have used different polyclonal and monoclonal antibodies. Because 
some polyclonal antibodies recognize weak bands in addition to the c-erfcB-2 { 
protein band on Western blot or immunoprecipitation, the results of these 
studies should be interpreted with caution. 22 * 36 * 47 ' 61 Even some monoclonal anti- 
bodies immunoprecipitate protein bands in addition to c-erfcB-2 (plSS). 30 - 5986 
Second, tissue fixation contributes to variability between studies. For example, 
some antibodies detect c-erfeB-2 protein only in frozen tissue and do not react 
in fixed tissue. In general, formalin fixation diminishes the sensitivity of 
immunohistochemical methods and decreases the number of reactive cells. 8186 
When Jte^iinV fixative Js^ulredv'ther^ may be a higher ^percenta^e of positive 
cases. 22 Third, minima] criteria for interpreting immunohistochemical staining 
are generally lacking. Although there is general agreement that distinct crisp 
cytoplasmic membrane staining is diagnostic for c-erfcB-2 activation in breast 
carcinoma, the number of positive cells and the staining intensity required to 
diagnose c-erfeB-2 protein overproduction varies from study to study and from 
antibody to antibody. Degradation of c-erfeB-2 protein is not a problem because 
it can be detected in intact form more than 24 hours after tumor resection 
without fixation or freezing, 64 



ACTIVATION OF c-erbB-2 IN BREAST LESIONS 
Incidence of c-erftB-2 Activation 

Most studies of c-erfoB-2 oncogene activation do not specify histological sub- 
types of infiltrating breast carcinoma. Amplification of c-erfcB-2 DNA was found 
in 19.1 percent (519 of 2715) of invasive carcinomas in 25 studies (Rible 1), and 
c-erfcB-2 mRNA or protein overproduction was detected in 20.9 percent (566 of 
2714) of invasive carcinomas in 20 studies. Twelve studies have documented c- 
erfcB-2 mRNA or protein overproduction in 15 percent (88 of 604) of carcinomas 
that lacked c-erbB-2 DNA amplification. 

The incidence of c-erfeB^2 activation in infiltrating breast carcinoma varies 
with the histological subtype. Approximately 22 percent (142 of 650) of infiltrat- 
ing ductal carcinomas have c-erfeB-2 activation, as expected firom the above 
data. Other variants of breast carcinoma with frequent oeri?B-2 activation are 
inflammatory carcinoma (62 percent, 54 of 87), Paget's disease (82 percent, 9 of 
11), and medullary carcinoma (22 percent, 5 of 23). In contrast, c-erfcB-2 activa- 
tion is infrequent in infiltrating lobular carcinoma (7 percent, 5 of 73) and 
tubular Carcinoma (7 percent, 1 of 15). 

The c-erbB-2 protein overproduction is present in 44 percent (44 of 100) of 
ductal carcinomas in situ and especially comedocarcinoma in situ (68 percent, 
49 of 72). The micropapillary type of ductal carcinoma in situ also tends to have 
c-erfcB-2 activation, 40 * 54 * 68 especially if larger cells are present. The greater fre- 
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quency of c-er&B-2 protein overproduction in comedocarcinoma in situ, com- 
pared with infiltrating ductal carcinoma, could be explained by the fact that 
many infiltrating ductal carcinomas arise from other types of intraductal carci- 
noma, which show c-er&B-2 activation infrequently. Others have speculated 
that carcinoma in situ with c-ertB-2 activation tends to regress or to lose c- 
erbB-2 activation during progression to invasion, 4068 '* 2 Infiltrating and in situ 
components of ductal carcinoma, however, usually are similar with respect to c- 
erbB-2 activation, 1139 although some authors have noted more heterogeneity of 
the immunohistochemical staining pattern in invasive than in in situ carci- 
noma. Activation of c-eriB-2 is infrequent in lobular carcinoma in situ. If 
lesions contain more than one histological pattern of carcinoma in situ, the c- 
erfcB-2 protein overproduction tends to occur in the comedocarcinoma in situ 
*r ^ . but may include other areas of carcinoma Sn situ; 42 ^ 6 * Overproduction of c- 
erfeB-2 protein in ductal carcinoma in situ correlates with larger cell size and a 
periductal lymphoid infiltrate. 68 

Activation of c-erfcB-2 has not been identified in benign breast lesions, 
including fibrocystic disease, fibroadenomas, and radial scars (Table 2), Strong 
membrane immunohistochemical reactivity for oerfeB-2 has not been described 
in atypical ductal hyperplasia, although weak accentuation of membrane staining 
has been noted infrequently. 39 ^ 54 In normal breast tissue, c-erfcB-2 DNA is 
diploid, and c-er&B-2 is expressed at lower levels than in activated tumors. 34 - 35 65 88 

These preliminary data suggest that c-erfcB-2 activation may not be useful 
for resolving many of the common problems in diagnostic surgical pathology. For 
example, c-erfeB-2 activation is infrequent in tubular carcinoma and radial scars. 
In addition, because c-crfoB-2 activation is unusual in atypical ductal hyperplasia, 
cribriform carcinoma in situ, and papillary carcinoma in situ, detection of c-er&B- 
2 activation in these lesions may not be helpful in their differential diagnosis. The 
histological features of comedocarcinoma in situ, which commonly overproduces 
c-er&B-2, are unlikely to be mistaken for those of benign lesions. Activation of 

TABLE 2. c-efbB-3 ACTIVATION IN BENIGN MAN BREAST LESIONS 





e*er*B-2 DNA 


c-ertB-2 mRNA 


c-erbB-2 Protein 


Histological Diagnosis 


Amplification* 


Overproduction 


Overproduction 


Fibrocystic disease 


0/1 0* 




0/32W079 ( M CV8w 


Atypical ductal hyperplasia 






2(weak)/21 « 

1 (cytoplasmic)/^ 38 


Benign ductal hyperplasia 






0/12 3 * 


Sclerosing adenosis 






074» 


Fibroadenomas 


0/1 6 » 0/6" 
0/2," 0/1" 


0/6* 0/3» 


0/21 »> 0/1 0,w 
0/8»0/3«a 


Radial scars 






Q/22P 


Blunt dud adenosis 






0/1 4 s8 


"Breast mastosls" 




0/3» 





•Shown as number of cases with actlvattorVraimoer of cases studied; reference Is given as a superscript 
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c-erfeB-2, however, does favor infiltrating ductal carcinoma over infiltrating 
lobular-carcinoma. Further studies of these issues would be useful. 

Correlation of c-erbB-2 Activation With Pathologic Prognostic Factors 

Multiple studies have attempted to correlate c-erfcB-2 activation with various 
pathologic prognostic factors (Table 3). Activation of c-erbBr2 was correlated 
with lymph node metastasis in 8 of 28 series, with higher histological grade in 6 
of 17 series, and with higher stage in 4 of 14 series. Large tumor size was not 
associated with c-erfcB-2 activation in most studies (11 of 14). Tetraploid DNA 
content and low proliferation, measured by Ki-67, have been suggested as 
prognostic factors and may correlate with c-erfcB-2 activation. 6 * 7 

Correlation of c-er bB-2 Activation With Clinical Prognostic Factors 

Various studies have attempted also to correlate c-erfcB-2 activation with clinical 
features that may predict a poor outcome (Table 4). Activation of c-erbB-2 
correlated with absence of estrogen receptors in 10 of 28 series and with ab- 
sence of progesterone receptors in 6 of 18 series. In most studies, patient age 
did not correlate with c-erfcB-2 activation, and, in the rest of the reports, c- 
erbB-2 activation was associated with either younger or older ages. 

Correlation of c-erbB-2 Activation With Patient Outcome 

Slamon et al 78 - 8 * first showed that amplification of the c-eriB-2 oncogene inde- 
pendently predicts decreased survival of patients with breast carcinoma. The 
correlation of c-crbB-2 amplification with poor outcome was nearly as strong as 
the correlation of number of involved lymph nodes with poor outcome. Slamon 
et al also reported that c-erfcB-2 amplification is an important prognostic indica- 
tor only in patients with lymph node metastasis. 19 * 81 

A large number of subsequent studies also attempted to correlate c-eriB-2 
activation with prognosis (Table 5). In 12 series, there was a correlation be- 
tween c-erbB-2 activation and tumor recurrence or decreased survival. In five 
of these series, the predictive value of c-erfcB-2 activation was reported to be 
independent of other prognbstic factors. In contrast, 18 series did not confirm 
the correlation of c-erfcB-2 activation with recurrence or survival. Four possible 
explanations for this controversy are discussed below. 

One problem is that c-erfeB-2 amplification correlates with prognosis 
mainly in patients with lymph node metastasis. As summarized in Table 5, most 
studies of patients with axillary lymph node metastasis showed a correlation of 
c-erfcB-2 activation with poor outcome. In contrast, most studies of patients 
without axillary metastasis have not demonstrated a correlation with patient 
outcome. Table 6 summarizes the studies in which all patients (with and with- 
out axillary metastasis) were considered as one group. There is a trend for 
studies with a higher percentage of metastatic cases to show an association 
between c-erfoB-2 activation and poor outcome. Thus, most of the current 
evidence suggests that c-er&B-2 activation has prognostic value only in patients 
with metastasis to lymph nodes. 
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TABLE 6. CORRELATION OF C-erbM ACTIVATION WITH OUTCOME IN PATIENTS 
WITH BREAST CARCINOMA 



Number of Patients 



Type of 



With 
Metastases to 





c-erbB-2 
Activation* 


Totef 


Axillary No 
tymp/i Nodes Metesfas/s 


Statistical 
Analysts 0 


Reference 


<0 05 


DMA 


1 /O 




M 


87 




DMA 


ft1 

07 




i i 

u 


50 




DMA 


Of 




u 


65 


<0 05 


DMA 


•♦1 




1 1 

u 


93 


<0 05 


mRMA 


oz 




1 1 

u 


65 


< 0 05 




V.' 




kM 

M 


101 


<0 05 


DMA 




345 


M 


81 


<0 05 


DMA 




ion 


U 


17 


<0 05 


DMA 






1 1 

u 


87 


<0 05 


DMA 




ob 


ki 


79 


<0 05 


riuiviirvVD 






kM 


BE 

85 


<0 05 


Pmtftin 




CO 


1 1 
U 


101 


0.05-0.15 


DNA 


57 




u 


111 


0.05-0.15 


Protein 


189 




M 


92 


0.05-0.15 


Protein 




120 


U 


66 


>0.15 


DNA 


130 




U 


113 


>0.1S 


DNA 


122 




M 


4 


>0.15 


ONA 


50 




U 


44 


>0.15 


mRNA 


57 




U 


50 


>0.15 


Protein 


290 




M 


86 


>0,15 


Protein 


195 




U 


11 


>0.15 


Protein 


102 




U 


39 


>0.15 


Protein 




137 


u 


17 


>0.15 


DNA 




181 


M 


61 


>0.15 


DNA 


... — v' r - • - 


159 


' U 


17 


>0.15 


DNA 




73 


u 


* 7 


>0.15 


Proteln-WB 




378 


u 


85 


>0.15 


Protein- WB 




192 


u 


17 


>0,15 


Protein 




141 


u 


86 


>0.15 


Protein 




41 


u 


40 



•The endpotnts of these studies were tumor recurrence or decreased survival or both. Correlation between o- 
e/oB-2 activation end a poorer patient outcome Is statistically significant at <o,05, to of equivocal significance 
at 0.0S to 0.1 6, and Is not significant at >0. 15.. 

6 Shown as variable measured. Letters "WB" Indicate assay by Western blot; the other protein studies used 
fmmunohistochemlcai method!. 

C M - muHlvarlate statistical analysis; U = univariate statistical analysis. 
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TABLE 6. PERCENTAGE OF BREAST CARCINOMAS WITH METASTASIS COMPARED 
WfTH PROGNOSTIC SIGNIFICANCE OF ACTIVATION 



%of tumors with 
lymph node 
metastasis In 
each study 



70- 



60- 



50- 



40- 



71 (DNA)» 



61 <0NA)*> 

69 (DNA) 07 
58(Proteln)i<» 



64(DNA) 



42(Protein)« 



+ 



P<0.05 



+ 



0.05<P<0.15 



64 (mRNA) 50 
61(DNA)< 



68 (DNA) 44 
57 (DNA)i« 
55(PfOtelnj» 

46(Protein)n 
46(Proteln)» 



P>0:15 



P tor correlation of c-eroB-2 activation with patient outcome. 

Each study's percentage of breast carcinomas with metastasis is compared with the correlation between o- 
erfrB-2 activation and outcome. These data induce only (hose studies that considered, as one group, afl breast 
cancer patients, whether or not they had axHtery metastasis. Superscripts are the references. In parentheses 
are the types of c-eroB-2 activation. P values are Interpreted as In Tabte 3. 



A second problem is that various types of breast carcinoma are grouped 
together in many survival studies. Because the current literature suggests that 
c-erbb-2 activation is infrequent in lobular carcinoma, studies that combine 
infiltrating ductal and lobular carcinomas may dilute the prognostic effect of o 
eriB-2;activation iri ductal tumors. In addition, most studies do not analyze 
inflammatory breast carcinoma separately. This condition frequently shows c- 
erfcB-2 activation and has a worse prognosis than the usual mammary carci- 
noma, but it is an uncommon lesion. 

A third potential problem is the paucity of studies that attempt to correlate 
c-erfoB-2 activation with clinical outcome in subsets of breast carcinoma without 
metastasis. Two recent abstracts reported that in patients without lymph node 
metastasis who had various risk factors for recurrence (such as large tumor size 
and absence of estrogen receptors), c-erfcB-2 overexpression predicted early 
recurrence. 23 67 In patients with ductal carcinoma in situ, one small study found 
no association between tumor recurrence and c-erbB-2 activation. 40 

A fourth problem is the lack of data regarding whether the prognosis 
correlates better with c-erbB-2 DNA amplification or with mRNA or protein 
overproduction. Most studies that find a correlation between c-€rfeB-2 activa- 
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tion and poor patient outcome measure c-eriB-2 DNA amplification (Table 5), 
and breast carcinoma patients with greater amplification of c-erfcB-2 may have 
poorer survival. 79 - 81 Recent studies suggest that amplification has more prognos- 
tic power than overproduction,"^ DUt t ^ e clinical significance of c-eriB-2 
overproduction without DNA amplification deserves further research. 17 ^ Few 
studies have attempted to correlate patient outcome with c-erfeB-2 mRNA 
overproduction, and many studies of c-erfcB-2 protein overproduction use rela- 
tively less reliable mediods such as immunohistochemical studies with poly- 
clonal antibodies. 

Comparison of c-ertB-2 Activation With Other Oncogenes in 
Breast Carcinoma 

Other oncogenes that may haye prognostic implications in human breast cancer 
are reviewed elsewhere. 71 .* 06 This section will be restricted to a comparison 
between the clinical relevance of c-erfcB-2 and these other oncogenes. 

Hie c-myc gene is often activated in breast carcinomas, but c-nw/c activa- 
tion generally has less prognostic importance than c-erfcB-2 activation. w,04,77.«7.93 
One study found a correlation between increased mRNAs of c-erfeB-2 and c- 
tnyc, although other reports have not confirmed this, 34 - 106 Subsequent research, 
however, could demonstrate a subset of breast carcinomas in which c-myc has 
more prognostic importance than c-erfcB-2. 

The gene c-er&B-l for the epidermal growth iactor receptor (EGFR) is 
homologous with c-erbB-2 but is infrequently amplified in breast carcinomas. 79 
Overproduction of EGFR, however, occurs more frequently than amplification 
and may correlate with a poor prognosis. In studies that have examined both c- 
erfcB-2 and EGFR in the same tumor, c-erfcB-2 has a stronger correlation with 
poor prognostic factors. ».* Studies have tended to show no correlation between 
amplification of c-erf>E-2 and c-erbB-1 or overproduction of c-erfcB-2 and EGFR, 
although at the molecular level EGFR mediates phosphorylation of c-^riB-2 
protein. 51 ' 5 *- 61 Recent reviews describe EGFR in breast carcinoma, 43 - 100 

The genes c-erfeA and ear-1 are homologous to the thyroid hormone recep- 
tor, and they are located adjacent to oerfcB-2 on chromosome 17. These genes 
are frequently coamplified with c-er&B-2 in breast carcinomas. The absence of 
c-erbA expression in breast carcinomas, however, is evidence against an impor- 
tant role for this gene in breast neoplasia. 90 Amplification of c-erf>B-2 can occur 
without ear-1 amplification, and these tumors have a decreased survival that is 
similar to tumors with both c-erfoB-2 and ear-1 amplification. 87 Consequendy, 
c-erhB-2 amplification seems to be more important than amplification of oerbA 
or ear-1. 

Other genes also have been compared with c-erfeB-2 activation in breast 
carcinomas. One study found a significant correlation between increased c-erfcB- 
2 mRNA and increased mRNAs of/as, platelet-derived growth factor chain A, 
and Ki-ras.™ Allelic deletion of c-Ha-r<w may indicate a poorer prognosis in 
breast carcinoma,* 1 but it has not been compared with c-er&B-2 activation. Some 
studies have suggested a correlation between advanced stage or recurrence of 
* breast carcinoma and activation of any one of several oncogenes.*^ 3 
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ACTIVATION OF c-erbB-2 IN NON-MAMMARY TISSUES 

Incidence of c-ert>B-2 Activation in Non-Mammary Tissues 

Table 7 summarizes the normal tissues In which c-er&B-2 expression has been 
detected, usually with iminunohistochetnical methods using polyclonal anti- 



TABLE 7. PRESENCE OR ABSENCE OF c-e/*B-2 mRNA OA *er*B4 PROTEIN IN 
NORMAL HUMAN TISSUES 



Tissues With 

c-erbB-2 Tissues Producing Tissues Lacking Tissues Lacking 

mRNA c-erbB-2 Protein* c-erfcB-2 mRNA c-erbB-2 Protein 



Skin« 


Epidermis 58 




■ 


external root sneatn 65 




ccenne sweat gianu 






reiai orai mucosa 0 * 


rosinaiai orai mucosa^ 






rvvll Ha la) twvpnayUo 


Stomach « 












,—,94 

Jejunum" 


Small Intestine 22 ' 82 


■ 


Colon" 


Colon 22 ' 62 




KWney* 


Fatal kidneys Kidneys'" 


Glomerulus 62 






Postnatal Bowman's capsule 62 




Fetal proximal tubule 82 


Postnatal proximal tubule 62 




Distal tubule 82 






Fetal collecting duct 62 


Postnatal collecting dud 62 




Fetal renal pelvis 82 


Postnatal renal pelvis 62 




Fetal ureter* 


Postnatal fetal ureter 62 


Uver* 


Hepatocytes 22 


Liver 62 - 95 




Pancreatic acini 22 






Pancreatic ducts 2 * 62 






Endocrine cells of Islets 


Pancreatic Islets 62 




ot Langertians 22 






.^FetaVtrachea 62 ^ . w 


Postnatal trachea 62 


Fetal bronchioles 62 


Polfnatal bronchioles 62 ' 




Bronchioles 89 








Postnatal alveoli 82 - 99 


Fetal braird 




Postnatal brain 62 




Fetal ganglion cells 62 


Postnatal ganglion ceils 82 


Thyroid 1 






Uterus 24 








Ovary 12 






Blood vessels 42 


Endothelium 92 


Placenta* 1 










Adrenocortical cells 62 






Postnatal thymus 92 






Fibroblasts 62 






Smooth muscle ceils 62 






Cardiac muscle cells 62 



*Thl$ protein study used Western blots; the rest used Immunohistochomteal methods. 
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bodies. Only a few studies have been performed, and some of these do not 
demonstrate convincing cell membrane reactivity in the published photo- 
graphs. The interpretations in these studies, however, are listed, with die 
caveat that these findings should be confirmed by immunoprecipitation or 
Western or RNA blots. Production of c-erfcB-2 has been identified in normal 
epithelium of the gastrointestinal tract and skin. Discrepancies regarding c- 
erbB-2 protein in other tissues could be due, at least in part, to differences in 
techniques. 

The data on c-erfoB-2 activation in various non- mammary neoplasms 
should be interpreted with caution, because only small numbers of tumors have 
been studied, usually by immunohistochemical methods using polyclonal anti- 
bodies. Studies using cell lines have been excluded, because cell culture can 
induce amplification and overexpression of other genes, although this has not 
been documented for c-€rbB-2_ 

Activation of c-erfeB-2 has been identified in 32 percent (64 of 203) of 
ovarian carcinomas in eight studies (Table 8). One abstract 45 stated that ovarian 
carcinomas contained significantly more c-eriB-2 protein than ovarian non- 
epithelial malignancies. Another report 91 showed that 12 percent of ovarian 
carcinomas had c-er&B-2 overproduction without amplification. 

Activation of c-erbB~2 has been identified in 20 percent (40 of 198) of 

• gastric adenocarcinomas in seven studies, including 33 percent (21 of 64) of 



TABLE 8. c^rbB-2 ACTIVATION IN HUMAN GYNECOLOGIC TUMORS" 







c-ert>B-2 


c-eroB-2 






roRNA 


Protein 




o-erbB-2 DNA 


Over- 


Over* 


Tumor Type 


Amplification 


production 


production 


Ovary— carcinoma, not otherwise 


31/120 « 1/11 w 


23/67*1 


23/73* 


specified 


0/6 WQ/6W0/3,"* 
072*0/1"° 




3677281 


Ovary— serous (papillary) carcinoma 


2/7/10 1/7,1120/6* 






bvary^ndomelrtoid carcinoma 


0/31W 






Ovary— mucinous carcinoma 


Wo 0/1* 






Ovary— clear cell carcinoma 


072, 1 «0/1* 






Ovary— mixed epithelial carcinoma 


0/2* 






Ovary— endometrioid borderline tumor 


0/1* 






Ovary— mucinous borderline tumor 


0/3* 






Ovary— serous cystadenoma 


0/4* 






Ovary— mucinous cystadenoma 


0/2* 






Ovary— sclerosing stromal tumor 


0/1* 






Ovary— fibrothecoma 


0/1* 






Uterus — endometrial adenocarcinoma 


0/4" 0/1 110 







'Shown as number of cases with amplification (or overproduction ytotal number of cases studied; reference is 
fllven as superscript All protein studies used Immunohfetochemlcal methods. 

I 
i 

I 
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intestinal or tubular subtypes and 9 percent (4 of 47) of diffuse or signet ring cell 
subtypes (Table 9). Activation of c-erfcB-2 has been detected in 2 percent (6 of 
281) of colorectal carcinomas, although an additional immunohistochemical 
study detected c-erfcB-2 protein in seven of eight tissues fixed in Bouin's solu- 
tion. One study found greater immunohistochemical reactivity for c-erfci-2 
protein in colonic adenomatous polyps than in the adjacent normal epithelium, 
using Bouins fixative. Lesions with anaplastic features and progression to inva- 
sive carcinoma tended to show decreased immunohistochemical reactivity for c- 
erfeB-2 protein. 22 Hepatocellular carcinomas (12 of 14 cases) and cholangiocarci- 
nomas (46 of 63 cases) reacted with antibodies against c-erfcB-2 in one study, but 
some of these "positive 1 ' cases showed only diffuse cytoplasmic staining, which 

TABLE a oorbB-2 ACTIVATION IN HUMAN OASTRO^ESTINAL TU MORS* 

c-ert>B-2 
Protein 



Tumor Type 


c-ert)B-2 DMA 
Amplification 


Over- 
production 


Esophagus— squamous cell carcinoma 


071 w 


0/1" 


Stomach— cartinoma, poorly differentiated 


0/22 1M 




Stomach— adenocarcinoma 


2/24,«2n?, 107 2/8, 11 ' 


4/27»3/10* 




won** 




Stomach— carcinoma, intestinal or tubular type 


6V10<« 


16/64» 


Stomach— carcinoma, diffuse or signet ring cell type 


0/2i<* 


4/45® 


Colorectum— carcinoma 


2/49 « 1/45,"' 


1/22" 7/B 226 




1/45, w 1/45> 






0/40," 0/32,"" 0/3« 




Colon— villous adenoma 


0/1W 




Colon— tubulovUlous adenoma 


0/5*> 




Colon— tubular fcdenoma 


077*> 


19/1 


Colon— hyperplastic polyp 


0/1« 




Intestine— leiomyosarcoma 




0/1W 


Hepatocellular carcinoma 




12/14 b OP' 


Hepatoblastoma 


0/1 57 




Cholanglocarcinoma 




46/63*5 


Pancreas— adenocarcinoma 




2W0/2 61 


Pancreas— acinar carcinoma 




0/1 41 


Pancreas — clear cell carcinoma 






Pancreas— large cell carcinoma 




0/3* 


Pancreas— signet ring carcinoma 




0/1* 


Pancreas— chronic Inflammation 




0/14 41c 



•Shown as number of cases with amplification (or overproductlon)/lota! number of cases studied; reference is 
given as superscript. All protein studies used Immunohistochemical methods. No studies analyzed for c~e/©B- 
2mRNA. 

Tissues fixed in Bouin's solution. 

c Onty cases with distinct membrane staining are Interpreted as showing oeroB-2 overproduction. 
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TABLE 10. oerpB-2 ACTIVATION IN HUMAN PULMONARY TUMORS* 







c-eroB-2 


Tumor Type 


c-erbB-2 ONA 


Protein 


Amplification 


Overproduction 


Non-small cell carcinoma 




1/84« 


Epidermoid carcinoma 


0/13 « 0/10*7 0/6» 


3/5" 


Adenocarcinoma 


0/21 « 1/13 » 0/7,'" 0/7* 0/3<« 


4/12* 


Large cell carcinoma 


079«0/6«> 




Small cell carcinoma 




0/26M073" 


Carcinoid tumor 


0/1* 


0/3* 



•Shown as number of oases with amplification (or overproductlonytotal number of cases studied; reference Is 
given as superscript All protein studies used Immunohlstoehemlcal methods. No studies analyzed tor c-ertB- 
2 mRNA. • <■-.,. 



does not indicate c-er&B-2 activation in breast neoplasms. 95 Also, some pancre- 
atic carcinomas and chronic pancreatitis tissue had cytoplasmic immunohisto- 
chemical reactivity for c-erbB-2 protein, in addition to the rare case of pancre- 
atic adenocarcinoma with distinct cell membrane staining. 41 

Tables 10 through 14 summarize the studies of c-erbB-2 activation in other 
neoplasms. The c-eriB-2 oncogene is not activated in most of these tumors! 
Activation of c-er6B-2 has been detected in 1 percent (4 of 299) of pulmonary 
non-small cell carcinomas in nine studies, although one additional report 66 
found c-erfcB-2 protein overproduction in 41 percent (7 of 17). Renal cell carci- 
noma had c-er£>B-2 activation in 7 percent (2 of 30) in four studies. Overproduc- 
tion of c-erfcB-2 protein was described in one transitional cell carcinoma of the 
urinary bladder, a grade 2 papillary lesion. 58 Squamous cell carcinoma and basal 
cell carcinoma of the skin may contain c-erhB-2 protein, but it is not clear 



TABLE 11. c-erbB-2 ACTIVATION IN HUMAN HEMATOLOGIC PROLIFERATIONS* 







o-erftB-2 


c-erbB-2 






mRNA 


Protein 




c«*rbB-2 DMA 


Over- 


Over- 


Tumor Type 


Amplification 


production 


production 


Hematologic malignancies 


0/231" 






Malignant lymphoma 


0/9, 67 0/3™ 


Q/V 


0/15*" 


Acute leukemia 


0/1 4s 7 






Acute lymphoblastic leukemia 


0/1 107 






Aoute myeloblasts leukemia 


073™ 






Chronic leukemia 


0/1957 






Chronic lymphocytic leukemia 


0/6™ 






Chronic myelogenous leukemia 


0/6™ 






Myeloproliferative disorder 


0/1 " 







"Shown as numbsr of oases with amplification (or overproductionytotal number ol cases studied; referenoe Is 
given as superscript. All protein studies used knmimohtstochemical methods. 
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TABLE 12. c-wtoB-2 ACTIVATION IN HUMAN TUMORS OF SOFT TISSUE AND BONE* 





C-ertB-2 DMA 


Tumor Type 


Amplification 


Sarcoma 


0/10," 1 0/8" 


Malignant fibrous histiocytoma 


0/1 w 


Liposarcoma 




Pleomorphic sarcoma 


0/1 iw 


Rhabdomyosarcoma 


0/1 tor 


Osteogenic sarcoma 




Chondrosarcoma 


Q/1107 


Ewing's sarcoma 


0/1 » 


Schwannoma 


0/1*7 



'Shown as number of cases with amplification (or overorodudlon)/total number of cases studied; reference rs 
gVen as superscript No studies analyzed for c*rt>B-2 mRNA or c-erfiB-2 protein. 



whether the protein level is increased over that of normal skin. 56 Thyroid 
carcinomas and adenomas can have low levels of increased c-eriB-2 mRNA. 
♦ ° ne abstract , described low-level c-*rfcB-2 DNA amplification in one of ten 

salivary gland pleomorphic adenomas. 48 

Correlation of c-ert>B-2 Activation With Patient Outcome 

Very few studies have attempted to correlate c-erfcB-2 activation in non- 
mammary tumors with outcome. Slamon et al 81 showed that c-erbB-2 amplifica- 
tion or overexpression in ovarian carcinomas correlates with decreased survival, 
especially when marked activation is present. However, they did not report the 
stage, histological grade, or histological subtype of these neoplasms. Another 
study of stages III and IV ovarian carcinomas found a correlation between 
decreased survival and c-erfcB-2 protein overproduction, but not between sur- 
vival and histological grade.** One abstract stated that c-erfcB-2 protein overpro- 
duction in 10 of 16 pulmonary adenocarcinomas correlated with decreased 
disease-free interval. 70 Another abstract described a tendency for immunohisto- 



TABLE 13. c-erfrB-2 ACTIVATION IN HUMAN TUMORS OF THE URINARY TRACT 8 



Tumor Type 


o-erbB-2 DNA 
Amplification 


c-e/bB-2 
mRNA 
Over- 
production 


c-erbB-2 
Protein 
Over- 
production 


Kidney— renal cell carcinoma 


1/6,67 1/4i 107 0/5M 


0/16 1 * 




Wilms' tumor 


0/4* 






Prostate— adenocarcinoma 






0/2358 


Urinary bladder— carcinoma 






1/40» 



•Shown as number of cases with amplification (or overoroducttan)rtotal number of cases studied; reference is 
given as superscript. All protein studies used ImmunohlstochemiceJ methods. . 
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TABLE 14. c-erfrB-2 ACTIVATION IN MISCELLANEOUS HUMAN TUMORS- 



Tumor Type 



c-erbB-2 
DNA 
Amplification 



Skin— malignant melanoma 
Skin, head and neck — squamous 
cell carcinoma 

Site not stated — squamous cell 
carcinoma 

Salivary gland— adenocarcinoma 
Parotid gland— adenoid cystic 
carcinoma 

Thyroid-- anaplastic carcinoma 
Thyroid— papillary carcinoma 
Thyroid— adenocarcinoma 
Thyroid— adenoma 
Neuroblastoma 
Meningioma 



oert>B-2 mRN A 
Overproduction 



c-ertjB*2 
Protein 
Over- 
production 



0/7™ 
0/8" 0/2* 
1/1* 



0/1 < 
0/5' 
0/1" 

0V2 1 1 (low levels)*' 

0/35*0/9,^0/1* — 
0/2 57 _ 



0/1 0» 



0/1* 



^/l^ ~ 
3(low levete)/5' 



•Shown es number of cases wrth amplification (or overproducttenyiotal number of cases studied* referee i« 
given as superscript All protein studies used ImmunohlstochemloaJ methods. 

chemical reactivity for c-erfeB-2 protein to correlate with higher grades of pros- 
tatic adenocarcinoma." Additional prognostic studies of ovarian carcinomas and 
other neoplasms are needed. 



SUMMARY 

Activation of the c-er6B-2 oncogene can occur by amplification of c-er*B-2 
DNA and by overproduction of c-er*B-2 mRNA and*-eriB-2 protein. Approxi- 
mately 20 percent of breast carcinomas ^show evidence of c-erfcB-2 activation 
which correlates with a poor prognosis primarily in patients with metastasis to 
axillary lymph nodes. Studies that have attempted to correlate c-*r&B-2 activa- 
tion with other prognostic factors in breast carcinoma have reported conflicting 
conclusions. The pathologic and clinical significance of c-erfcB-2 activation in 
other neoplasms is unclear and should be assessed by additional studies 
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DECLARATION OF PAUL POLAKIS, Ph.D. 

» 

I, Paul Polakis, Ph.D., declare and say as follows: , 

1. I was awarded a PhD by the Department of Biochemistry of the Michigan 
State University in 1984. My scientific Curriculum Vitae is attached to and forms 
part of this Declaration (Exhibit A). 

2. I am currently employed by Genentech, Inc. where my job title is Staff 
Scientist. S ince joining Genentech in 1999, one of my primary responsibilities has 
been leading Genentech's Tumor Antigen Project, which is a large research project 
with a primary focus on identifying tumor cell markers that find use as targets for 
both the diagnosis and treatment of cancer in humans. 

3. As part of the Tumor Antigen Project, my laboratory has been analyzing 
differential expression of various genes in tumor cells relative to normal cells 
The purpose of this research is to identify proteins that are abundantly expressed 
on certain tumor cells and that are either (i) not expressed, or (ii) expressed at 
lower levels, on corresponding normal cells. We call such differentially expressed 
proteins "tumor antigen proteins". When such a tumor antigen protein is 
identified, one can produce an antibody that recognizes and binds to that protein. 
Such an antibody finds use in the diagnosis of human cancer and may ultimately 
serve as an effective therapeutic in the treatment of human cancer. 

4. In the course of the research conducted by Genentech's Tumor Antigen 
Project, we have employed a variety of scientific techniques for detecting and 
studying differential gene expression in human tumor cells relative to normal cells 
at genomic DNA, mRNA and protein levels. An important example of one such 
technique is the well known and widely used technique of microarray analysis 
which has proven to be extremely useful for the identification of mRNA molecules 
that are differentially expressed in one tissue or cell type relative to another. In the 
course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor cells at 
significantly higher levels than in corresponding normal human cells. To date we 
have generated antibodies that bind to about 30 of the tumor antigen proteins ' 
expressed from these differentially expressed gene transcripts and have used these 
antibodies to quantitatively determine the level of production of these tumor 
antigen proteins in both human cancer cells and corresponding normal cells We 
have then compared the levels of mRNA and protein in both the tumor and normal 
cells analyzed. 

5. From the mRNA and protein expression analyses described in paragraph 4 
above, we have observed that there is a strong correlation between changes in the 
level of mRNA present in any particular cell type and the level of protein 



expressed from that mRNA in that cell type. In approximately 80% of our 
observations we have found that increases in the level of a particular mRNA 
correlates with changes in the level of protein expressed from that mRNA when 
human tumor cells are compared with their corresponding normal cells. 

6. Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4 and 5 above and my 
knowledge of the relevant scientific literature, it is my considered scientific 
opinion mat for human genes, an increased level of mRNA in a tumor cell relative 
to a normal cell typically correlates to a similar increase in abundance of the 
encoded protein in the tumor cell relative to the normal cell. In fact, it remains a 
central dogma in molecular biology that increased mRNA levels are predictive of 
corresponding increased levels of the encoded protein. While there have been 
published reports of genes for which such a correlation does not exist, it is my 
opinion that such reports are exceptions to the commonly understood general rule 
that increased mRNA levels are predictive of corresponding increased levels of the 
encoded protein. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be true, 
arid further that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both 
under Section 1 00 1 of Title 1 8 of the United States Code and that such willful 
statements may jeopardize the validity of the application or any patent issued 
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glioma xenografts by Apo2L/TRAIL. Biochem. Biophys. Res. Commun. 265, 479- 
483(1999). , 

54. Hymowitz, S.G., Christinger, H.W., Fuh, G., Ultsch, M., O'Connell, M., Kelley, 
R-F., A^hkOTazi^A. and de Vos, A.M. Triggering Cell Death: The Crystal 
Structure of Apo2I/TRAIL in a Complex with Death Receptor 5. Molec. Cell 4, 
563-571 (1999). 

55. Hymowitz, S.G., O'Connel, M.P., Utsch, M.H., Hurst, A., Totpal, K, Ashkenazi. 
jA, de Vos, A.M., Kelley, R.F. A unique zmc-binding site revealed by a high- 
resolution X-ray structure of hombtrimeric Apo2L/TRAIL. Biochemistry 39, 633- 
640(2000). 9 •■»_ ' » " — ■*> » 

56. Zhou, Q., Fukushima, P., DeGraff, W., Mitchell, J.B., Stetler-Stevenson, M., 
Ashkenazi. A., and Steep. P.S. Radiation and the Apo2L/TRAIL apoptotic 
pathway preferentially inhibit the colonization of premalignant human breast 
cancer cells overexpressing cyclin Dl. Cancer Res. 60, 261 1-2615 (2000). 

57. Kischkel, F.C., Lawrence, D. A-, Chuntharapai, A., Schow, P., Kim, J., and 
Ashkenazi. A. Apo2I7TRAIL-dependent recruitment of endogenous FADD and 
Caspase-8 to death receptors 4 and 5. Immunity 12, 61 1-620 (2000). 

58. Yan, M., Marsters, S.A., Grewal, I.S., Wang, H., * Ashkenazi. A., and *Dixit, 
V.M. Identification of a receptor for BlyS demonstrates a crucial role in humoral 
immunity. Nature Immunol. 1, 37-41 (2000), 
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59. Marsters, S.A., Yan, M., Pitti, R.M., Haas, P.E., Dixit, V.M., and Ashkenazi. A. 
Interaction of the TNF homologues BLyS and APRIL with the TNF receptor 
homologues BCMA and TACI. Curr. Biol. 10, 785-788 (2000). 

60. Kischkel, F.C., and Ashkenazi. A . Combining enhanced metabolic labeling with 
immunoblotting to detect interactions of endogenous cellular proteins. 
Bidtechniques 29, 506-512 (2000). 

61. Lawrence, D., Shahrokh, Z., Marsters, S., Achilles, K., Shih, D. Mounho,B., 
Hillan, KL, Totpal, K. DeForge, L., Schow, P., Hooley, J., Sherwood, S., Pai, R., 
Leung, S., Khan, L, Gliniak, B., Bussiere, J., Smith, C, Strom, S., Kelley, S:, 
Fox, J , Thomas, D., and Ashkenazi. A. Differential hepatocyte toxicity of 
recombinant Apo2L/TRAIL versions. Nature Med. 7, 383-385 (2001). 

62. Chuntharapai, A., Dodge, K., Grimmer, K., Schroeder, K., Martsters, S.A., 
Koeppen. H.. Ashkenazi. A ., and Kim. K.J. Isotype-dependent inhibition of 
tumor growth in vivo by monoclonal antibodies to death receptor 4. J. Immunol. 
166,4891-4898(2001). 

63. Pollack, IF.. Erff. M. and Ashkenazi. A . Direct stimulation of apoptotic 
signaling by soluble Apo2L/tumor necrosis factor-related apop tosis-inducing 
ligand leads to selective killing of glioma cells. Clin. Cancer Res.. 7, 1362-1369 
(2001). 

64. .Wang, H., Marsters, S. A., Baker, T., Chan, B., Lee, W.P., Fu, L., Tumas, D., Yan, 
M., Dixit, V.M., ♦ Ashkenazi. A ., and *Grewal, I.S. TACI-ligahd interactions are 
required for.T cell activation and collagen-induced arthritis in mice. Nature 
Immunol. 2, 632-637 (2001). 

65. Kischkel F.C., Lawrence, 5 D^ A,, Tinel, A., Virmani, A.,-Schow, P., Gazdar, A., 
Blenis, J.. Arnott P.. and Ashkenazi. A . Death receptor recruitment of 
endogenous caspase-10 and apoptosis initiation in the absence of caspase-8. J. 
Biol. Chem. 276, 46639-46646 (2001). 

66. LeBlanc, H., Lawrence, D.A., Varfolomeev, E„ totpal, K., Morlan, J., Schow, P., 
Fong, S., Schwall, R., Sinicropi, D., and Ashkenazi. A T umor cell resistance to 
death receptor induced apoptosis through mutational inactivation of the 
proapoptotitc Bcl-2 homolog Box..- Nature Med. 8, 274-281 (2002). 

67. Miller, K., Meng, G., Liu, J., Hurst, A., Hsei, V., Wong, W-L., Ekert, R, 
Lawrence, D., Sherwood, S., DeForge, L., Gaudreault, Keller, G., SUwkowski, 
M., Ashkenazi. A ., and Presta, L. Design, Construction, and analyses of 
multivalent antibodies. J. Immunol. 170, 4854-4861 (2003): 
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68. Varfolomeev, E., Kischkel, R, Martin, F., Wanh, H., Lawrence, D,, Olsson, C, 
Tom, L., Erickson, S., French, D., Schow, P., Grewal, I. and AshkenazL A. 
Immune system development in APRIL knockout mice. Submitted; 

Review articles: 

1. AshkenazL A., Peralta, E., Winslow, j., Ramachandran, J., and Capon, D., J. 
Functional role of muscarinic acetylcholine receptor subtype diversity. Cold 
Spring Harbor Symposium on Quantitative Biology. LIII,\263-272 (1988). . 

2. AshkenazL A .. Peralta, E., Winslow, J., Ramachandran, J., and Capon, D. 
Functional diversity of muscarinic receptor subtypes in cellular signal 
transduction and growth. Trends Pharmacol Scu Dec Supplement, 12-21 (1989). 

3. Chamow, S., Duliege, A., Ammann/A., Kahn, J., Allen, D., Eichberg, J., Byrn, 
R., Capon, D., Ward, R., and AshkenazL A . CD4 immunoadhesins in anti-HIV 
therapy: new developments. Int. J. Cancer Supplement 7, 69-72 (1992). 

4. AshkenazL A ., Capon, and D. Ward, R. Immunoadhesins. Int. Rev. Immunol 10, 
217-225 (1993). 

5. AshkenazL A ., and Peralta, E. Muscarinic Receptors. In Handbook of Receptors 
and Channels. (S. Peroutka, ed.), CRC Press, Boca Raton, Vol. I, p. 1-27, (1994). 

6. Krantz, S. B., Means, R. T., Jr., Lina, J., Marsters, S. A., and AshkenazL A . 
Inhibition of erythroid colony formation in vitro by gamma interferon. In 
Molecular Biology of Hematopoiesis (N. Abraham, R, Shadduck, A. Levine F. 
Takaku, eds.) Intercept Ltd. Paris, Vol. 3, p. 135-147 (1994). 

7. AshkenazL A . Cytokine neutralization as a potential therapeutic approach for 
? SIRS and shock. J. Biotechnology in Healthcare 1, 197-206 (1994). - 

8; AshkenazL A ., and Chamow, S. M. Immunoadhesins: an alternative to human , 
monoclonal antibodies. Immunomethods: A companion to Methods in 
Enzimology 8, 104-1 15 (1995). . 

9. Chamow, S., and AshkenazL A . Immunoadhesins: Principles and Applications. 
Trends Biotech. 14, 52-60 (1996). 

10. AshkenazL A ., and Chamow, S. M. Immunoadhesins as research tools and 
therapeutic agents. Curr. Opin. Immunol 9, 195-200 (1997). — 

1 1 . AshkenazL A ., and Dixit, V. Death receptors: signaling and modulation. Science 
281,1305-1308(1998). 

12. AshkenazL A ., and Dixit, V, Apoptosis control by death and decoy receptors. 
Curr. Opin. Cell Biol 11, 255-260 (1999). 



13. AshkenaziA . Chapters on Apo2L/TRAIL; DR4, DR5, DcRl, DcR2; and DcR3. 
Online Cytokine Handbook (www.apnetcom/cvtokinereferenceA . 

14. Ashkenazu A , Targeting death and decoy receptors of the tumor necrosis factor 
superfamily. Nature Rev, Cancer 2, 420-430 (2002). 

15. LeBlanc, H. and Ashkenazi. A . Apoptosis signaling by Apo2L/TRAIL. Cell Death 
and Differentiation 10, 66-75 (2003). 

16. Almasan, A. and Ashkenazi. A . Apo2L/TRAIL: apoptosis signaling, biology, and 
potential for cancer therapy. Cytokine and Growth Factor Reviews 14, 337-348 
(2003), 

Book: 

Antibody Fusion Proteins (Chamow, S., and AshkenazL A ~ eds,, John Wiley and 
Sons Inc.) (1999). 

Talks: 

1 . Resistance of primary HIV isolates to CD4 is independent of CD4-gp 1 20 binding 
affinity. UCSD Symposium, HTV Disease: Pathogenesis and Therapy. 
Greenelefe, FL, March 1991. 

2. Use of immuno-hybrids to extend the half-life of receptors. IBC conference on 
Biopharmaceutical Halflife Extension. New Orleans, LA, June 1992. 

3. Results with TNF receptor Immunoadhesins for the Treatment of Sepsis. IBC 
conference on Esdotoxemia and Sepsis. Philadelphia, PA/ June-4 992; ^ 

4. Immunoadhesins: an alternative to human antibodies. EBC conference on 
Antibody Engineering. San Diego, CA, December 1 993. 

5. Tumor necrosis factor receptor: a potential therapeutic for human septic shock. 
American Society for Microbiology Meeting, Atlanta, GA, May 1 993 . 

6. Protective efficiacy of TNF receptor immunoadhesin vs anti-TNF monoclonal 
antibody in a rat model for endotoxic shock. 5th International Congress on TNF. 
Asilomar, CA, May 1994. 

7. Interferon-y signals via a multisubunit receptor complex that contains two types of 
polypeptide chain. American Association of Immunologists Conference. San 
Franciso, CA, July 1995. 

8. Immunoadhesins: Principles and Applications. Gordon Research Conference on 
Drug Delivery in Biology and Medicine. Ventura, CA, February 1996. 
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Apo-2 Ligand, a new member of the TNF family that induces apoptosis in tumor 
cells. Cambridge Symposium on TNF arid Related Cytokines in Treatment of 
Cancer. Hilton-Head, NC, March 1996. 

Induction of apoptosis by Apo2 Ligand. American Society for Biochemistry and 
Molecular Biology, Symposium on Growth Factors and Cytokine Receptors. New 
Orleans, LA, June, 1996. 

Apo2 ligand, an extracellular trigger of apoptosis. 2nd Clontech Symposium, 
Palo Alto, C A, October 1996. 

Regulation of apoptosis by members of the TNF ligand and receptor families. 
Stanford University School of Medicine/Palo Alto, CA, December 1996. 
Apo-3: anovel receptor that regulates cell deauVand inflammation. 4th 
international Congress on Immune Consequences of Trauma, Shock, and Sepsis. 
Munich, Germany, March 1997. 

New members of the TNF ligand and receptor families that regulate apoptosis, 
inflammation, and immunity. UCLA School of Medicine, LA, CA, March 1997. 
Immunoadhesins: an alternative to monoclonal antibodies: 5th World Conference 
on Bispecific Antibodies. Volendam, Holland, June 1997. 
Control of Apo2L signaling. Cold Spring Harbor Laboratory Symposium on 
Programmed Cell Death. Cold Spring Harbor, New York. September, 1997. 
Chairman and speaker, Apoptosis Signaling session. IBC's 4th Annual 
Conference on Apoptosis. San Diego, CA., October 1997. 
Control of Apo2L signaling by death and decoy receptors. American Association 
for the Advancement of Science. Philadelphia, PA, February 1998. 
Apo2 ligand and itsjfeceptors. American Society of Immunologists.. San 
Francisco, CA, April 1998. 

Death receptors and ligands. 7th International TNF Congress. Cape Cod, MA, 
May 1998. 

Apo2L as a potential therapeutic for cancer. UCLA School of Medicine. LA, 
CA, June 1998. 

Apo2L as a potential therapeutic for cancer. Gordon Research Conference on 
Cancer Chemotherapy. New London, NH, July 1998. 

Control of apoptosis by Apo2L: Endocrine Society Conference, Stevenson, WA, 
August 1998. 

Control of apoptosis by Apo2L. International Cytokine Society Conference, 
Jerusalem, Israel, October 1998. 
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25. Apoptosis control by death arid decoy receptors. American Association for 
Cancer Research Conference, Whistler, BC, Canada, March 1 999. 

26. Apoptosis control by death and decoy receptors. American Society for ; 
Biochemistry and Molecular Biology Conference, San Franciscd, CA, May 1999. 

27. Apoptosis control by death and decoy receptors. Gordon Research Conference on 
Apoptosis, New London, NH, June 1999. 

28. Apoptosis control by death and decoy receptors. Arthritis Foundation Research 
Conference, Alexandria GA, Aug 1999. 

29. Safety and anti-tumor activity of recombinant soluble Apo2L/TRAEL Cold 
Spring Harbor Laboratory Symposium on Programmed Cell Death. . Cold Spring 
Harbor, NY, September 1999. 

30. The Apo2L/TRAJL system: therapeutic potential. American Association for 
Cancer Research, Lake Tahoe, NV, Feb 2000. 

31. Apoptosis and cancer therapy. Stanford University 'School of Medicine, Stanford, 
CA, Mar 2000. 

32. Apoptosis and cancer therapy. University of Pennsylvania School of Medicine, 
Philadelphia, PA, Apr 2000. 

33. Apoptosis signaling by Apo2L/TRAIL. International Congress on TNF. 
Trondheim, Norway, May 2000. 

34. The Apo2L/TRAIL system: therapeutic potential. Cap-CURE summit meeting. 
Santa Monica, CA, June 2000. 

35. The Apo2L/TRAIL system: therapeutic potential. MD Anderson Cancer Center. 
Houston, TX, June 2000. 

36. r Apc^tosis signaling by^ 

San Diego, CA, August 2000. 

37. Anti-tumor activity of Apo2L/TRAIL. AAPS annual meeting. Indianapolis, IN 
Aug 2000. 

38. Apoptosis signaling and anti-cancer potential of Apo2L/TRAIL. Cancer Research 
Institute, UC San Francisco, CA, September 2000. 

39. Apoptosis signaling by Apo2I/TRAIL. Kenote address, TNF family 
Minisymposium, NIH. Bethesda, MD, September 2000. 

40. Death receptors: signaling and modulation. Keystone symposium on the 
Molecular basis of cancer. Taos, NM, Jan 2001 . 

41 . Preclinical studies of Apo2I/TRAIL in cancer. Symposium on Targeted therapies 
in the treatment of lung cancer. Aspen, CO, Jan 2001. 
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42. Apoptosis signaling by Apo2I7TRAIL. Wiezmann Institute of Science, Rehovot, 
Israel, March 2001. 

43. Apo2L/TRAIL: Apoptosis signaling and potential for cancer therapy. Weizmann 
Institute of Science, Rehovot, Israel, March 2001. 

44. Targeting death receptors in cancer with Apo2L/TRAIL. Cell Death and Disease 
conference, North Falmouth, MA, Jun 2001. 

45. Targeting death receptors in cancer with Apo2L/TRAIL. Biotechnology 
Organization conference, San Diego, CA, Jun 2001. 

46. Apo2L/TRAIL signaling and apoptosis resistance mechanisms. Gordon Research 
Conference on Apoptosis, Qxford, UK, July 2001. 

47. Apo2L/TRAIL signaling and apoptosis resistance mechanisms. Cleveland Clinic 
Foundation, Cleveland, OH, Oct 2001. 

48. Apoptosis signaling by death receptors: overview. International Society for 
Interferon and Cytokine Research conference, Cleveland, OH, Oct 2001. 

49. Apoptosis signaling by death receptors. American Society of Nephrology 
Conference. San Francisco, C A Oct 2001. 

50. Targeting death receptors in cancer. Apoptosis: commercial opportunities. San 
Diego, CA Apr 2002. 

51. Apo2L/TRAEL signaling and apoptosis resistance mechanisms. Kimmel Cancer 
Research Center, Johns Hopkins University, Baltimore MD. May 2002. 

52. Apoptosis control by Apo2L/TRAIL. (Keynote Address) University of Alabama 
Cancer Center Retreat, Birmingham, Ab. October 2002. 

53 . Apoptosis signaling hy Apo2L/TRAlL. (Session co-chair) TNF international 

*• (jonfererice. €an Diego, CAs October 2002. ~* »•' *• — - 

54. Apoptosis signaling by Ap62L/fRAIL. Swiss Institute for Cancer Research 
(ISREC). Lausanne, Swizerland. Jan 2003. 

55. Apoptosis induction with Apo2L/TRAIL. Conference on New Targets and 
Innovative Strategies in Cancer Treatment. Monte Carlo. February 2003. 

56. Apoptosis signaling by Apo2L/TRAIL. Hermelin Brain Tumor Center 
Symposium on Apoptosis. Detroit, MI. April 2003. 

57. Targeting apoptosis through death receptors. Sixth Annual Conference on 
Targeted Therapies in the Treatment of Breast Cancer. Kona, Hawaii July 2003. 

58. Targeting apoptosis through death receptors. Second International Conference on 
Targeted Cancer Therapy. Washington, DC. Aug 2003. 

Issued Patents: 
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1. Ashkenazi, A., Chamow, S. and Kogan, T. Caibohydrate-directed crosslinking 
reagents, US patent 5,329,028 (Jul 12, 1994). 

2. Ashkenazi, A, Chamow, S. and Kogan, T. CarbohydrateTdirected crosslinking 
reagents. US patent 5,605,791 (Feb 25, 1997). • ' ' 

3. Ashkenazi, A, Chamow, S. and Kogan, T. Carbohydrate-directed crosslinking 
reagents. US patent 5,889,155 (Jul 27, 1999). 

4. Ashkenazi, A, APO-2 Ligand. US patent 6,030,945 (Feb 29, 2000). 

5. Ashkenazi, A, Chuntharapai, A., Kim, J., APO-2 ligand antibodies. US patent 6, 
046, 048 (Apr 4, 2000). 

6. Ashkenazi, A, Chamow, S. and Kogan, T. Carbohydrate-directed crosslinking 
reagents. US patent 6,124,435 (Sep 26, 2000). 

7. Ashkenazi, A, Chuntharapai, A, Kim, J., Method for making monoclonal and cross- 
reactive antibodies. US patent 6,252,050 (Jun 26, 2001). 

8. Ashkenazi, A APO-2 Receptor. US patent 6,342,3^9 (Jan 29, 2002). 

9. Ashkenazi, A. Fong, S., Goddard, A, Gurney, A, Napier, M., Tumas, D., Wood, W. 
A-33 polypeptides. US patent 6,410,708 (Jun 25, 2002). 

10. Ashkenazi, A. APO-3 Receptor. US patent 6,462,176 Bl (Oct 8, 2002). 

11. Ashkenazi, A, APO-2LI and APO-3 polypeptide antibodies. US patent 6,469,144 Bl 
(Oct 22, 2002). 

12. Ashkenazi, A, Chamow, S. and Kogan, T. Carbohydrate-directed crosslinking 
reagents, US patent 6,582,928B1 (Jun 24, 2003). 
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Tue Feb 12 09:55:34 2002 [BLASTP 2.2.1 [Jul-12-2001] , NCBI] 
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Sequences producing High-scoring Segment Pairs: 
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-val 


1 


P_AAB875 85 


Human PR01573 - Homo sapiens. 


1172 


225 


100 


e 


-127 


2 


P_AAG89285 


Human secreted protein, SEQ ID NO: 405 - 


1172 


225 


100 


e 


-127 


3 


P_AAM79556 


Human protein SEQ ID NO 32 02 - Homo sapie 


1172 


225 


100 


e 


-127 


4 


P__AAM78572 


Human protein SEQ ID NO 1234 - Homo sapie 


1172 


225 


100 


e 


-127 


5 


P_AAU292 02 


Human PRO polypeptide sequence #179 - Horn 


1172 


225 


100 


e 


-127 


6 


P_AAY99433 


Human PR01573 (UNQ779) amino acid sequenc 


1172 


225 


100 


e 


-127 


7 


BAA95567.114 


CLDN8 - Homo sapiens 


1172 


225 


100 


e 


-127 


8 


AAH20866 .114 


claudin 8 - Homo sapiens 


1172 


225 


100 


e 


-127 


9 


NPJ)36264.1 


claudin 8 -. Homo sapiens 


1172 


225 


100 


e 


-127 


10 


CLD8_HUMAN 


Claudin-8 /pid=CAB60615 . 1 - homo sapiens 


1172 


225 


100 


e 


-127 


11 


P_AAG89273 


Human secreted protein, SEQ ID NO: 393 - 


1162 


223 


99 


e 


-126 


12 


P_AAU00678 


Human INTERCEPT 309 polypeptide - Homo sa 


1123 


215 


100 


e 


-122 


13 


P_AAB93923 


Human protein sequence SEQ ID NO: 13910 - 


1111 


213 


99 


e 


-120 



Dayhoff Protein Database (Rel 78, Mar 2004) 

P_AAB87585 Human PR01573 - Homo sapiens . 
Length: 225 aa 
Accession: P__AAB87585; 
Species: Homo sapiens. 

Keywords: Human; PRO protein; mapping; patent; GENE SEQ patentdb . 

Patent number: WO200116318-A2 . 

Publication date: 08-MAR-2001. 

Filing date: 24-AUG-2000; 2000WO-US023328 . 

Priority: 01-SEP-1999; 99WO-US020111 . 15-SEP-1999; 99WO-US021090 . 

07-DEC-1999; 99US-0169495P . 09-DEC-1999; 99US- 0170262P . 

ll-JAN-2000; 2000US-0175481P . 18-FEB-2000; 2000WO-US004341 . 

18-FEB-2000; 2 000WO-US004342 . 22-FEB-2000; 2000WO-US004414 . 

01-MAR-2000; 2000WO-US005601 . 03-MAR-2000; 2000US-0187202P . 

21-MAR-2000; 2 000US- 0191007P . 30-MAR-2000; 2000WO-US00843 9 . 

25-APR-2000; 2000US-0199397P . 22-MAY-2000; 2000WO-US014042 . 

05-JUN-2000; 2000US-0209832P . 
Assignee: (GETH ) GENENTECH INC. 

Inventors: Eaton DL, Filvaroff E, Gerritsen ME, Goddard A, Godowski PJ; 

Grimaldi CJ, Gurney AL, Watanabe CK, Wood WI; 
Cross reference: WPI; 2001-183260/18. N-PSDB; AAF92117. 
Title: Eighty four nucleic acids encoding PRO polypeptides, useful in 

molecular biology, including use as hybridization probes, and in 

chromosome and gene mapping. 
Patent format: Claim 12; Fig 120; 278pp; English. 

Comment: The present sequence is a human PRO polypeptide (secreted and 

transmembrane) . The PRO protein, and PRO agonists, PRO antagonists 
or anti-PRO antibodies are useful for preparation of a medicament 
useful in the treatment of a condition which is responsive to the 
PRO protein, agonists, antagonists or anti-PRO antibodies. The PRO 
protein may also be employed as molecular weight markers for 
protein electrophoresis. The PRO coding sequence has applications 
in molecular biology, including use as hybridisation probes, and in 
chromosome and gene mapping 

Database: GENESEQ patent database (v200420, 23-SEP-2004) . 



P_AAG89285 Human secreted protein, SEQ ID NO: 405 - Homo sapiens. 
Length: 225 aa 
Accession: P_AA<389285; ■ 
Species: Homo sapiens. 

Keywords: Human; secreted protein; gene therapy; vaccine; treatment; 

diagnosis; GENSET ; patent; GENESEQ patentdb. 
Patent number: WO200142451-A2 . 
Publication date: 14-JUN-2001. 
Filing date: 07-DEC-2000; 2000WO-IB001938 . 

Priority: 08-DEC-1999; 99US-0169629P . 06-MAR-2000; 2000US-0187470P . 
Assignee: (GEST ) GENSET. 

Inventors: Dumas Milne Edwards J, Bougueleret L, Jobert S; 
Cross reference: WPI; 2001-367870/38. N-PSDB; AAH64888. 

Title: Full length GENSET human nucleic acids encoding potentially secreted 
proteins, useful in gene therapy and vaccination against a variety 
of diseases, and for diagnosis of those diseases. 

Patent format: Claim 2.1; Page 885-886; 921pp; English. 

Comment: The invention relates to full length GENSET human nucleic acids 
encoding potentially secreted proteins. The nucleic acids and the 
polypeptides they encode may be used in the prevention, treatment 
and diagnosis of diseases associated with inappropriate GENSET gene 
expression. For example, they be used to treat disorders associated 
with decreased GENSET gene expression by rectifying mutations or 
deletions in a patient's genome that affect the activity of GENSET 
or by supplementing the patients own production of GENSET 
polypeptides. Conversely, antisense nucleic acid molecules may be 
administered to down regulate GENSET expression by binding with the 
cells' own genes and preventing their expression. The sense and 
antisense nucleic acids may also be used as DNA probes in 
diagnostic assays to detect and quantitate the presence of similar 
nucleic acid sequences in samples, and hence to determine which 
patients may be in need of restorative therapy. The GENSET 
polypeptides may be used as antigens in the production of 
antibodies and in assays to identify modulators (agonists and 
antagonists) of GENSET polypeptide expression and activity. The 
present sequence is a GENSET polypeptide of the invention 

Database:. GENESEQ patent database (v200420, 23-SEP-2004) . 

P_AAM79556 Human protein SEQ ID NO 3202 - Homo sapiens. 
Length: 231 aa 
Accession: P_AAM79556; 
Species: Homo sapiens. 

Keywords: Human; cytokine; cell proliferation; cell differentiation; gene 

therapy; vaccine; peptide therapy; stem cell growth factor; 

haematopoiesis; tissue growth factor; immunomodulatory; cancer; 

leukaemia; nervous system disorder; arthritis; inflammation; 

patent; GENESEQ patentdb. 
Patent number: WO200157190-A2 . 
Publication date: 09-AUG-2001. 
Filing date: 05-FEB-2001; 2001WO-US004098 . 

Priority: 03-FEB-2 000; 2000US- 00496914 . 27-APR-2 000; 2000US- 00560875 . 

20-JUN-2 000; 2000US- 005 98075 . 19-JUL-2000 ; 2 0OOUS- 00620325 . 

01-SEP-2000; 2000US- 00654936 . 15-SEP-2000; 2000US- 00663561 . 

20-OCT-2000; 2000US- 00693325 . 30-NOV-2000; 2000US- 00728422 . 
Assignee: (HYSE-) HYSEQ INC. 

Inventors: Tang YT, Liu C, Drmanac RT, Asundi V, Zhou P, Xu C, Cao Y; Ma Y, 
Zhao QA, Wang D, Wang J, Zhang J, Ren F, Chen R, Wang ZW; Xue AJ, 



Yang Y, Wejhrman T, Goodrich R; 
Cross reference: WPI; 2001-476283/51. N-PSDB; AAK52689. 

Title: Nucleic acids encoding polypeptides with cytokine-like activities, 

useful in diagnosis and gene therapy. 
Patent format: Claim 20; Page 283; 6221pp; English. 

Comment: The invention relates to polynucleotides (AAK51456-AAK53435) and 
the encoded polypeptides (AAM78323-AAM80302) that exhibit activity 
elating to cytokine, cell proliferation or cell differentiation or 
which may induce production of other cytokines in other cell 
populations. The polynucleotides ajid polypeptides are useful in 
gene therapy, vaccines or. peptide therapy. The polypeptides have 
various cytokine-like activities, e.g. stem cell growth factor 
activity, haematopoiesis regulating activity, tissue growth factor 
activity, immunomodulatory activity and activin/inhibin activity 
and may be useful in the diagnosis and/or treatment of cancer, 
leukaemia, nervous system disorders, arthritis and inflammation. 
Note: Records for SEQ ID NO 2110 (AAK52581) , 2111 (AAK52582) and 
3666 (AAM80020) are omitted as the relevant pages from the sequence 
listing were missing at the time of publication 

Database: GENE SEQ patent database (v200420, 23-SEP-2004) . 

P_AAM78572 Human protein SEQ ID NO 1234 - Homo sapiens. 
Length: 225 aa 
Accession: P_AAM78572 ; 
Species: Homo sapiens. 

Keywords: Human; cytokine; cell proliferation; cell differentiation; gene 

therapy; vaccine; peptide therapy; stem cell growth factor ; 

haematopoiesis; tissue growth factor; immunomodulatory; cancer; 

leukaemia; nervous system disorder; arthritis; inflammation; 

patent; GENE SEQ patentdb. 
Patent number: WO200157190-A2 . 
Publication date: 09-AUG-2001. 
Filing date: 05-FEB-2001; 2001WO-US004098 . 

Priority: 03-FEB-2000; 2000US-00496914 . 27-APR-2000; 2000US-00560875 . 

20-JUN-2000; 2000US- 00598075 . 19-JUL-2000; 2000US- 00620325 . 

01-SEP-2000; 2000US- 00654936 . 15-SEP-2000; 2000US-00663561 . 

20-OCT-2000; 2000US- 00693325 . 30-NOV-2000; 2000US- 00728422 . 
Assignee: (HYSE-) HYSEQ INC. 

Inventors: Tang YT, Liu C, Drmanac RT, Asundi V, Zhou P, Xu C, Cao Y; Ma Y 
Zhao QA, Wang D, Wang J, Zhang J., Ren F, Chen R, Wang ZW; Xue AJ, 
Yang Y, Wejhrman T, Goodrich R; 

Cross reference: WPI; 2001-476283/51. N-PSDB; AAK51705. 

Title: Nucleic acids encoding polypeptides with cytokine-like activities, 

useful in diagnosis and gene therapy. 
Patent format: Claim 20; Page 3493-3494; 6221pp; English. 

Comment: The invention relates to polynucleotides (AAK51456-AAK53435) and 
the encoded polypeptides (AAM78323-AAM80302) that exhibit activity 
elating to cytokine, cell proliferation or cell differentiation or 
which may induce production of other cytokines in other cell 
populations. The polynucleotides and polypeptides are useful in 
gene therapy, vaccines or peptide therapy. The polypeptides have 
various cytokine-like activities, e.g. stem cell growth factor 
activity, haematopoiesis regulating activity, tissue growth factor 
activity, immunomodulatory activity and activin/inhibin activity 
and may be useful in the diagnosis and/or treatment of cancer, 
leukaemia, nervous system disorders, arthritis and inflammation. 
Note: Records for SEQ ID NO 2110 (AAK52581) , 2111 (AAK52582) and 



3666 (AAM80020) are omitted as the relevant pages from the sequence 
listing were missing at the time of publication 
Database: GENESEQ patent database (v200420, 23-SEP-2004) . 

P_AAU29202 Human PRO polypeptide sequence #179 - Homo sapiens. 
Length: 225 aa 
Accession: P_AAU29202; 
Species: Homo sapiens. 

Keywords: PRO polypeptide; mammal; tumour; cancer; human; cattle; horse; 

sheep; dog; cat; pig; goat; rabbit; tumour necrosis factor alpha; 

TNF-alpha; blood; chondrocyte cell; cell proliferation; cell 

differentiation; colon; adrenal; lung; breast; prostate; rectum; 

cervix; liver; genetic disorder; patent; GENESEQ patentdb. 
Patent number: WO200168848-A2 . 
Publication date: 20-SEP-2001. 
Filing date: 28-FEB-2001; 2001WO-US006520 . 

Priority: 01-MAR-2000; 2000WO-US005601 . 02-MAR-2000; 2000WO-US005841 . 

03-MAR-2000; 2000US-0187202P . 01-DEC-2000 ; 2000WO-US032678 . 

20-DEC-2000; 2000WO-US034956 . plus 36 more dates. 
Assignee: (GETH ) GENENTECH INC. 

Inventors: Baker KP, Chen J, Desnoyers L, Goddard A, Godowski PJ, Gurney 

AL; Pan J, Smith V, Watanabe CK, Wood WI, Zhang Z; 
Cross reference: WPI; 2001-602746/68. N-PSDB; AAS46103. 

Title: Novel nucleic acids encoding PRO polypeptides, used to diagnose the 
presence of tumors, such as prostate and breast tumors, in mammals 
and to screen for modulators of the compounds. 

Patent format: Claim 11; Fig 358; 774pp; English. 

Comment: Sequences AAU29024-AAU2 9328 represent PRO polypeptides of the 

invention. The PRO polypeptides and their associated nucleic acids 
can be used to detect the presence of a tumour in a mammal by 
comparing the level of expression of a PRO polypeptide in a test 
sample of cells from the animal and a control sample of normal 
cells, whereby a higher level of expression in the test sample 
indicates the presence of a tumour in the mammal. Mammals include 
dogs, cats, cattle, horses, sheep, pigs, goats and rabbits but are 
preferably human. The polypeptides can be used to stimulate tumour 
necrosis factor (TNF) alpha release from human blood, when 
contacted with it. A specific polypeptide can be used to stimulate 
the proliferation or differentiation of chondrocyte cells. The PRO 
proteins can be used to determine the presence of tumours and also 
susceptibility to tumour development, particularly adrenal, lung, 
colon, breast, prostate, rectal, cervical, or liver tumours, in 
mammalian subjects. The oligonucleotide probes specific for the PRO 
nucleic acids can be used for genetic analysis of individuals with 
genetic disorders 

Database: GENESEQ patent database (v200420, 23-SEP-2004) . 

P_AAY99433 Human PR01573 (UNQ779) amino acid sequence SEQ ID NO: 328 - Homo 

sapiens . 
Length: 225 aa 
Accession: P_AAY99433 ; 
Species: Homo sapiens. 

Keywords: Human; PRO polypeptide; membrane bound protein; receptor; 

diagnosis; transmembrane; secretion; immunoadhesion; 

pharmaceutical; screening; patent; GENESEQ patentdb. 
Patent number: WO200012708-A2 . 
Publication date: 09-MAR-2000. 



Filing date: 01-SEP-1999; 99WO-US020111 . 

Priority: 01-SEP-1998; 98US-0098716P . 01-SEP-1998; 98US-0098749P . 

01-SEP-1998; 98US- 0098750P . 18 -NOV- 1998 ; 98US- 0108858P . 18-NOV-1998; 

98US-0108904P. plus 119 more dates. 
Assignee: (GETH ) GENENTECH INC. 

Inventors: Baker K, Goddard A, Gurney AL, Smith V, Watanabe CK, Wood WI; 

Cross reference: WPI; 2000-237871/20. N-PSDB; AAA37115. 

Title: New mammalian DNA sequences encoding transmembrane, receptor or 
secreted PRO polypeptides, useful for screening of potential 
peptide or small molecule inhibitors of the relevant 
receptor/ligand interactions. 

Patent format: Claim 12; Fig 188; 773pp; English. 

Comment: AAA37022 to AAA37144 encode the new isolated human transmembrane, 
receptor or secreted PRO polypeptides given in AAY99340 to 
AAY99462. The transmembrane and receptor PRO proteins can be used 
for screening of potential peptide or small molecule inhibitors of 
the relevant receptor/ligand interactions. The polypeptides and 
nucleotide sequences encoding then have various industrial 
applications, including uses as pharmaceutical and diagnostic 
agents. AAA37145 to AAA37330 represent PCR primers and 
hybridisation probes used in the isolation of the PRO polypeptides 
from the present invention 

Database: GENESEQ patent database (v200420, 23-SEP-2004) . 

BAA95567 CLDN8 /pid=BAA95567 . 1 - Homo sapiens 
Length: 225 aa 

Species: Homo sapiens (human) 

Hattori,M. , Fujiyama, A. , Taylor, T.D. , Watanabe, H. , Yada,T. , Park,H.S. , 
Toyoda,A., Ishii,K., Totoki,Y., Choi,D.K. , Soeda,E., Ohki,M., 
Takagi,T., Sakaki,Y., Taudien, S . , Blechschmidt , K. , Polley,A., 
Menzel,U., Delabar,J., Kumpf,K., Lehmann,R., Patterson, D . , 
Reichwald,K. , Rump, A., Schillhabel ,M. , Schudy,A. , Zimmermann, W. , 
Rosenthal , A. , Kudoh,J., Shibuya,K., Kawasaki , K. , Asakawa,S., 
Shintani , A . , Sasaki , T . , Nagamine , K . , Mi t suyama , S . , 
Antonarakis, S.E. , Minoshima,S. , Shimizu,N. , Nordsiek,G. , 
Hornischer,K. , Barandt,P., Scharfe,M. , Schoen,0., Desario,A., 
Reichelt,J., Kauer,G., Bloecker,H., Ramser, J. , Beck, A. , Klages,S., 
Hennig,S., Riesselmann, L . , Dagand,E., Wehrmaeyer , S . , Borzym,K. , 
Gardiner, K., Nizetic,D., Francis, F., Lehrach,H., Reinhardt,R. and 
Yaspo,M.L., Submitted (10-APR-2000) The Chromosome 21 Mapping and 
Sequencing Consortium: * RIKEN Genomic Sciences Center, Human 
Genome Research Group * Institute of Molecular Biotechnology, 
Genome Analysis * Keio University School of Medicine, Dept. of 
Molecular Biology * GBF, Dept. of Genome Analysis * Max-Planck 
Institute for Molecular Genetics (addresses see below) Title: 
Direct Submission 

Locus: AP001707 

Accession: AP001707 AL163252 BA000005 

Cross-references: GI: 7768788; BAA95567.1; AP001707_2 
Database: GBTRANS 

AAH20866 CLDN8 protein /pid=AAH20866 . 1 - Homo sapiens 
Length: 225 aa 

Species: Homo sapiens (human) 

Strausberg,R. , Submitted (03- JAN-2002) National Institutes of Health, 

Mammalian Gene Collection (MGC) , Cancer Genomics Office, National 
Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 



20892-2590, USA Title: Direct Submission 
Gene: CLDN8 
Locus: BC020866 
Accession: BC020866 

Cross-references: LocusID : 9073 ; AAH20866.1; BC020866_1 
Database: GBTRANS 

CLD8_HUMAN Claudin-8 /pid=CAB60615 . 1 - homo sapiens 
Length: 225 aa 

Species: Homo sapiens (Human) . 

Accession: P56748; EMBL; AJ250711; CAB60615.1. EMBL; AY358707; AAQ89070.1 
EMBL; AP001707; BAA95567.1. EMBL; BC020866; AAH20866.1. EMBL; 
BC058004; AAH58004.1. Genew; HGNC:2050; CLDN8 . InterPro; IPR006187; 
Claudin. InterPro; IPR006188; Claudin_reg. InterPro; IPR004031; 
PMP22_Claudin. Pfam; PF00822; PMP22_Claudin; 1. PRINTS; PR01077; 
CLAUDIN. PROSITE; PS01346; CLAUDIN; 1. 

Keen T.J., Inglehearn C.F., Submitted (nov-1999) to the Embl/genbank/ddbj 
databases, (ref. 1: sequence from n.a.) 

Clark H.F., Gurney A.L., Abaya E . , Baker K. , Baldwin p., Brush J., Chen J 
Chow B., Chui C, Crowley C. , Currell B., Deuel B., Dowd P., Eaton 
D., Foster J., Grimaldi C, Gu Q., Hass P.E., Heldens S., Huang A., 
Kim H.S., Klimowski L., Jin Y. , Johnson S., Lee J., Lewis L., Liao 
D. , Mark M. , Robbie E., Sanchez C. , Schoenfeld J., Seshagiri S., 
Simmons L. , Singh J., Smith V., Stinson J., Vagts A., Vandlen R. , 
Watanabe C, Wieand D., Woods K. , Xie M. -H . , Yansura D., Yi S., Yu 
G., Yuan J., Zhang M . , Zhang Z., Goddard A., Wood W.I,, Godowski 
P., Gray A ., Genome Res. 13, 2265-2270, 2003. MEDLINE=22887296 ; 
PubMed=12975309; DOI=10 . 1101/gr . 1293003 ; (ref. 2: sequence from 
n.a. ) Title: "The secreted protein discovery initiative (SPDI) , a 
large-scale effort to identify novel human secreted and 
transmembrane proteins: a bioinf ormatics assessment."; 

Hat tori M., Fujiyama A., Taylor T.D., Watanabe H w Yada T. , Park H.-S., 
Toyoda A., Ishii K., Totoki Y., Choi D.-K., Soeda E., Ohki M., 
Takagi T., Sakaki Y. , Taudien S., Blechschmidt K. , Pol ley A. , 
Menzel U. , Delabar J., Kumpf K. , Lehmann R., Patterson D., 
Reichwald K. , Rump A., Schillhabel M., Schudy A., Zimmermann W., 
Rosenthal A., Kudoh J., Shibuya K. , Kawasaki K. , Asakawa S., 
Shintani A., Sasaki T.., Nagamine K. , Mitsuyama S., Antonarakis 
S.E., Minoshima S., Shimizu N. , Nordsiek G. f Hornischer K. , Brandt 
P • / Scharfe M. , Schoen O., Desario A., Reichelt J., Kauer G. , 
Bloecker H., Ramser J., Beck A., Klages S., Hennig S., Riesselmann 
L. , Dagand E. # Wehrmeyer S., Borzym K., Gardiner K. , Nizetic D., 
Francis F., Lehrach H., Reinhardt R. , Yaspo M.-L., Nature 405, 
311-319, 2000. MEDLINE=20289799; PubMed=10830953 ; (ref. 3: sequence 
from n.a. ) Title: "The DNA sequence of human chromosome 21."; 

Strausberg R.L. , Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D., 

Collins F.S., Wagner L., Shenmen CM., Schuler G.D., Altschul S.F., 
Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., 
Jordan H., Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., 
Marusina K. , Farmer A. A., Rubin G.M., Hong L., Stapleton M., Soares 
M.B., Bonaldo M.F., Casavant T.L. , Scheetz T.E., Brownstein M.J., 
Usdin T.B., Toshiyuki S., Carninci P., Prange C, Raha S.S., 
Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., Bosak 
S.A., Mcewan P.J., Mckernan K.J., Malek J. A., Gunaratne P.H., 
Richards S., Worley K.C., HaleS., Garcia A.M., Gay L.J., Hulyk 
S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 
Fahey J., Helton E . , Ketteman M., Madan A., Rodrigues S., Sanchez 



A., Whiting M., Madan A., Young A.C., Shevchenko Y . , Bouffard G.G., 
Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez 
A.C., Grimwood J., Schmutz J. , Myers R.M., Butterfield Y.S.N. , 
Krzywinski M.I., Skalska U. , Smailus D.E., Schnerch A., Schein 
J.E., Jones S.J.M., Marra M.A., Proc . Natl. Acad. Sci . U.S.A. 99, 
16899-16903, 2002. MEDLINE=22388257 ; PubMed=12477932 ; 
DOI=10.1073/pnas. 242603899; (ref. 4: sequence from n. a.) Title: 
"Generation and initial analysis of more than 15,000 full-length 
human and mouse cDNA sequences . 11 

Keywords: tight junction; transmembrane. 

Taxid: tx:9606 

Gene name: Name=CLDN8 ; 

8-28/Domain: Transmem Potential. 

82-102/Domain: Transmem Potential. 

118-138/Domain: Transmem Potential. 

167-187/Domain: Transmem Potential. 

Database: Swissprot (SPROT) , Release 40 (Jan 11, 2003) 

P_AAG89273 Human secreted protein, SEQ ID NO: 393 - Homo sapiens. 
Length: 225 aa 
Accession: P_AAG892 73; 
Species: Homo sapiens. 

Keywords: Human; secreted protein; gene therapy; vaccine; treatment; 

diagnosis; GENSET; patent; GENE SEQ patentdb. 
Patent number: W0200142451-A2 . 
Publication date: 14-JUN-2001. 
Filing date: 07-DEC-2000; 2000WO-IB001938 . 

Priority: 08-DEC-1999; 99US-0169629P . 06-MAR-2000; 2000US- 0187470P . 
Assignee: (GEST ) GENSET. 

Inventors: Dumas Milne Edwards J, Bougueleret L, Jobert S; 
Cross reference: WPI; 2001-367870/38. N-PSDB; AAH64876. 

Title: Full length GENSET human nucleic acids encoding potentially secreted 
proteins, useful in gene therapy and vaccination against a variety 
of diseases, and for diagnosis of those diseases. 

Patent format: Claim 21; Page 879; 921pp ; English. 

Comment: The invention relates to full length GENSET human nucleic acids 
encoding potentially secreted proteins. The nucleic acids and the 
polypeptides they encode may be used in the prevention, treatment 
and diagnosis of diseases associated with inappropriate GENSET gene 
expression. For example, they be used to treat disorders associated 
with decreased GENSET gene expression by rectifying mutations or 
deletions in a patient's genome that affect the activity of GENSET 
or by supplementing the patients own production of GENSET 
polypeptides. Conversely, antisense nucleic acid molecules may be 
administered to down regulate GENSET expression by binding with the 
cells 1 own genes and preventing their expression. The sense and 
antisense nucleic acids may also be used as DNA probes in 
diagnostic assays to detect and quantitate the presence of similar 
nucleic acid sequences in samples, and hence to determine which 
patients may be in need of restorative therapy. The GENSET 
polypeptides may be used as antigens in the production of 
antibodies and in assays to identify modulators (agonists and 
antagonists) of GENSET polypeptide expression and activity. The 
present sequence is a GENSET polypeptide of the invention 

Database: GENE SEQ patent database (v200420, 23-SEP-2004) . 



P_AAU00678 Human INTERCEPT 309 polypeptide - Homo sapiens. 



Length: 215 aa 
Accession: P_AAU00678; 
Species: Homo sapiens. 

Keywords: Human; INTERCEPT 309; T cell; heart; liver; pancreas; placenta; 
brain; skeletal muscle; kidney; spleen; lymph node; peripheral 
blood leukocyte; bone marrow; thymus tissue; abnormal blood 
coagulation; asthma; cancer; anaphylaxis; hepatitis; multiple 
sclerosis; coronary artery disease; malaria; atopic dermatitis; 
amyotrophic lateral sclerosis; meningitis; attention deficit 
disorder; Crohn's disease; gastroenteritis; goitre; hypoglycaemia; 
diabetes mellitus; endometriosis; pulmonary embolism; muscular 
dystrophy; immuno- competence; vertebrate; blood; serum; lung; 
patent; GENESEQ patentdb. 

Patent number: WO200129088-A1 . 

Publication date: 26-APR-2001. 

Filing date: 23- JUN-2000 ; 2000WO-US017386 . 

Priority: 19-OCT-1999; 99US-00420707 . 

Assignee: (MILL- ) MILLENNIUM PHARM INC. 

Inventors: Mackay CR, Myers PS, Kirst SJ, Fraser CC, Leiby KR; 
Cross reference: WPI; 2001-308477/32. N-PSDB; AAS00668. 

Title: New isolated nucleic acid molecule for diagnosis, prevention, and 
therapy of human and other animal disorder, or as modulating agent 
for regulating cellular processes. 

Patent format: Claim 8; Fig 3A-3B; 263pp; English. 

Comment: The sequence represents human INTERCEPT 309 polypeptide. This 
protein and similar others exhibit the ability to affect growth, 
proliferation, survival, differentiation, activity, morphology, or 
movement /migration of, e.g. T cells and cells of the heart, liver, 
pancreas, placenta, brain, lung, skeletal muscle, kidney, spleen, 
lymph node, peripheral blood leukocyte, bone marrow or thymus 
tissue. They can be used as modulating agents for regulating 
cellular processes, thus, the proteins and their associated nucleic 
acids can be used to prognosticate, prevent, diagnose, or treat 
disorders associated with physiological processes. These disorders 
include abnormal blood coagulation, asthma, anaphylaxis, hepatitis, 
multiple sclerosis, cancer, coronary artery disease, malaria, 
atopic dermatitis, amyotrophic, lateral sclerosis, meningitis, 
attention deficit disorder, Crohn's disease, gastroenteritis, 
goitre, hypoglycaemia, diabetes mellitus, endometriosis, pulmonary 
embolism and muscular dystrophy. Antibodies to disorders such as 
these can be made by providing a polypeptide of the invention to an 
immuno-competent vertebrate and harvesting blood or serum from the 
vertebrate 

1-2 4 /Peptide 

/note= Signal peptide/ 

25-215/Protein 

/note= Mature human INTERCEPT 309/ 
25-71/Domain 

/note= Extracellular domain/ 
72 -92 /Domain 

/note= Transmembrane domain/ 
93- 107 /Domain 

/note= Cytoplasmic domain/ 
108-131/Domain 

/note= Transmembrane domain/ 
132 -15 3 /Domain 

/note= Extracellular domain/ 



154-178/Domain 

/note= Transmembrane domain/ 

179-215/Domain 

/note= Cytoplasmic domain/ 

Database: GENESEQ patent database (v200420, 23-SEP-2004) . 

P_AAB93923 Human protein sequence SEQ ID NO: 13910 - Homo sapiens. 
Length: 233 aa 
Accession: P_AAB93 923; 
Species: Homo sapiens. 

Keywords: Human; primer; detection; diagnosis; antisense therapy; gene 

therapy; patent; GENESEQ patentdb. 
Patent number: EP1074617-A2 . 
Publication date: 07-FEB-2001. 
Filing date: 28- JUL-2000 ; 2000EP- 00116126 . 

Priority: 29- JUL-1999 ; 99 JP-00248036 . 27-AUG-1999; 99JP-00300253 . 

ll-JAN-2000; 2000JP-00118776 . 02-MAY-2000; 2000 JP- 00183767 . 

09-JUN-2000; 2000JP-00241899 . 
Assignee: (HELI-) HELIX RES INST. 

Inventors: Ota T, Isogai T, Nishikawa T, Hayashi K, Saito K, Yamamoto J; 

Ishii S, Sugiyama T, Wakamatsu A, Nagai K, Otsuki T; 
Cross reference: WPI; 2001-318749/34. 

Title: Primer sets for synthesizing polynucleotides, particularly the 5602 
full- length cDNAs defined in the specification, and for the 
detection and/or diagnosis of the abnormality of the proteins 
encoded by the full-length cDNAs . 

Patent format: Claim 8; SEQ ID NO 13910; 2537pp + Sequence Listing; 
English. 

Comment: The present invention describes primer sets for synthesising 5602 
full- length cDNAs defined in the specification. Where a primer set 
comprises: (a) an oligo-dT primer and an oligonucleotide 
complementary to the complementary strand of a polynucleotide which 
comprises one of the 5602 nucleotide sequences defined in the 
specification, where the oligonucleotide comprises at least 15 
nucleotides; or (b) a combination of an oligonucleotide comprising 
a sequence complementary to the complementary strand of a 
polynucleotide which comprises a 5 1 -end sequence and an 
oligonucleotide comprising a sequence complementary to a 
polynucleotide which comprises a 3 ' -end sequence, where the 
oligonucleotide comprises at least 15 nucleotides and the 
combination of the 5 1 -end sequence/3 1 -end sequence is selected from 
those defined in the specification. The primer sets can be used in 
antisense therapy and in gene therapy. The primers are useful for 
synthesising polynucleotides, particularly full-length cDNAs. The 
primers are also useful for the detection and/or diagnosis of the 
abnormality of the proteins encoded by the full-length cDNAs . The 
primers allow obtaining of the full-length cDNAs easily without any 
specialised methods. AAH03166 to AAH13628 and AAH13633 to AAH18742 
represent human cDNA sequences; AAB92446 to AAB95893 represent 
human amino acid sequences; and AAH13629 to AAH13632 represent 
oligonucleotides, all of which are used in the exemplification of 
the present invention 

Database: GENESEQ patent database (v2 0042 0, 23 -SEP- 2 004) . 



### 

Tue Feb 5 15:17:50 2002 [BLASTN 2.2.1 [Jul-12-2001] , NCBI] 

/home/ruby/va/Molbio/carpenda/tempids/ss .DNA73735 (2010 bp) 
/home/ruby/va/Molbio/carpenda/tempids/ss .DNA73735 



Sequences producing High-scoring Segment Pairs: Frame 


Score 


Match 


Pet 


E-val 


1 


P_AAF92117 


Human PR01573 cDNA. 


+ 


2010 


2010 


100 


0 . 


o 


2 


P_AAS46103 


Human DNA encoding PRO polypeptide seque 


+ 


2010 


2010 


100 


0. 


0 


3 


P_AAA3 7115 


Human PR01573 (UNQ779) cDNA sequence SEQ 


+ 


2010 


2010 


100 


0. 


0 


4 


AX092388 


Sequence 119 from Patent WO0116318. DNA 


+ 


2010 


2010 


100 


0. 


0 


5 


AP000884 


Homo sapiens genomic DNA, chromosome 21q 




2006 


2009 


100 


0. 


0 


6 


AP001707 


Homo sapiens genomic DNA, chromosome 21q 




2006 


2009 


100 


0. 


0 


7 


HS21C052 


Homo sapiens chromosome 21 segment HS21C 




2006 


2009 


100 


0. 


0 


8 


P_AAH64874 


Human secreted protein cDNA, SEQ ID NO: 


+ 


1990 


2005 


100 


0. 


0 


9 


HSA250711 


Homo sapiens CLDN8 gene for claudin-8. 




1927 


1930 


100 


0. 


0 


10 


P_AAS00668 


Human INTERCEPT 3 09 cDNA. 




1879 


1882 


100 


0. 


0 


11 


P__AAH64876 


Human secreted protein cDNA, SEQ ID NO: 


+ 


1811 


1832 


100 


0. 


0 


12 


P_AAH15597 


Human cDNA sequence SEQ ID NO: 13909. 


+ 


1794 


1831 


99 


0. 


0 


13 


AK022269 


Homo sapiens cDNA FLJ122 07 fis, clone MA 




1794 


1831 


99 


0. 


0 


14 


P_AAH64888 


Human secreted protein cDNA, SEQ ID NO: 


+ 


1677 


1692 


100 


0. 


0 


15 


P_AAH64890 


Human secreted protein cDNA, SEQ ID NO: 


+ 


1662 


1689 


99 


0. 


0 



GenBank (Release 143, aug 2004) 



2010 100 0.0 

P_AAF92117 Human PR01573 cDNA. 010 bp, cDNA, PAT 15-MAY.-2001 
ACCESSION P__AAF92117 

KEYWORDS GENE SEQ; Human; PRO protein; mapping; patent; patentdb (v200420, 

23-SEP-2004).. 
SOURCE Homo sapiens . 

ORGANISM Homo sapiens. 
REFERENCE 1 (bases 1 to 2 010) 

AUTHORS" Eaton, D.L., Filvarof f , E . , Gerritsen, M. E . , Goddard,A. , 
Godowski, P. J. Grimaldi , C . J . , Gurney,A.L., Watanabe, C . K. , 
Wood, W.I. 

TITLE Eighty four nucleic acids encoding PRO polypeptides, useful in 

molecular biology, including use as hybridization probes, and in 
. chromosome and gene mapping . 
JOURNAL Patent: WO200116318-A2 ; Filing Date: 24-AUG-2000; 2000WO-US023328 ; 
Publication Date: 08-MAR-2001; Priority: 01-SEP-1999; 
99WO-US020111. 15-SEP-199.9; 99WO-US021090 . 07-DEC-1999; 
99US-0169495P. 09-DEC-1999; 99US-0170262P . ll-JAN-2000; 
2000US-0175481P. 18-FEB-2000; 2000WO-US004341 . 18-FEB-2000; 
2000WO-US004342 . 22-FEB-2000; 2000WO-US004414 . 01-MAR-2000; 
2000WO-US005601. 03-MAR-2000; 2 000US- 01872 02P . 21-MAR-2000; 
2000US-0191007P. 30-MAR-2000; 2 000WO-US008439 . 25-APR-2000; 
2000US-0159397P. 22-MAY-2000 ; . 2000WO-US014042 . 05-JUN-2000; 
2000US-0209832P; Assignee: (GETH ) GENENTECH INC; Cross Reference: 
WPI; 2001-183260/18. P-PSDB; AAB87585; Patent Format: Claim 2; Fig 
119; 278pp; English. 
COMMENT The present sequence is the coding sequence for a human PRO 

polypeptide (secreted and transmembrane) . The PRO protein, and PRO 
agonists, PRO antagonists or anti-PRO antibodies are useful for 
preparation of a medicament useful in the treatment of a condition 
which is responsive to the PRO protein, agonists, antagonists or 
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COMMENT Sequences AAS45925-AAS46231 represent DNA molecules encoding and PCR 

primers for PRO polypeptides of the invention. The sequences of the 
invention can be used to detect the presence of a tumour in a mammal 
by comparing the level of expression of a PRO polypeptide in a test 
sample of cells from the animal and a control sample of normal 
cells, whereby a higher level of expression in the test sample 
indicates the presence of a tumour in the mammal. Mammals include 
dogs, cats, cattle, horses, sheep, pigs, goats and rabbits but are 



preferably human. The polypeptides can be used to stimulate tumour 
necrosis factor (TNF) alpha release from human blood, when 
contacted with it. A specific polypeptide can be used to stimulate 
the proliferation or differentiation of chondrocyte cells. The PRO 
proteins can be used to determine the presence of tumours and also 
susceptibility to tumour development, particularly adrenal, lung, 
colon, breast, prostate, rectal, cervical, or liver tumours, in 
mammalian subjects. The oligonucleotide probes specific for the PRO 
nucleic acids can be used for genetic analysis of individuals with 
genetic disorders 
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GENENTECH INC; Cross Reference: WPI; 2000-237871/20. P-PSDB; 
AAY99433; Patent Format: Claim 2; Fig 187; 773pp; English. 

COMMENT AAA37022 to AAA37144 encode the new isolated human transmembrane, 

receptor or secreted PRO polypeptides given in AAY99340 to AAY99462 
The transmembrane and receptor PRO proteins can be used for 
screening of potential peptide or small molecule inhibitors of the 
relevant receptor/ ligand interactions. The polypeptides and 
nucleotide sequences encoding then have various industrial 
applications, including uses as pharmaceutical and diagnostic 
agents. AAA37145 to AAA37330 represent PCR primers and 
hybridisation probes used in the isolation of the PRO polypeptides 
from the present invention 
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Location/Qualifiers 

1. .340000 
/organism="Homo sapiens" 
/m61_type=" genomic DNA" 
/db_xref ="taxon: 9606" 
/chromosome= "21" 

<1. .148 
/organ ism=" Homo sapiens" 
/mol_type=" genomic DNA" 
/db_xref ="taxon: 9606" 
/chromosome= "21" 
./map="21q22 .1" 
/clone="R355D16, 5' partial" 
/cloneJLib="RPCI-ll BAC library" 
/note= "Accession No. AP000236" 

<1.. 48622 
/organism="Homo sapiens" 
/mol_type= "genomic DNA" 
/db_xref ="taxon: 9606" 
/chromosome= " 2 1 " 
/map="21q22.1" 
/clone="R347N6, 5' partial" 
/clone_JLib="RPCI-ll BAC library" 
/note= "Accession No. AP000390" 

<1. .57177 
/org anism= "Homo sapiens" 
/mol_type=" genomic DNA" 
/db_xref ="taxon: 9606" 
/ chromosome = "21" 
/map= "21q22.1" 

/clone="P681K20, 5' partial" 
/clone_JLib="RPCIl,3-5 PAC library" 
/note="Accession No. AF191069" 

26572. .135558 
/organism="Homo sapiens" 
/mol_type=" genomic DNA" 
/db xref="taxon: 9606" 



/chromosome= "21" 

/map="21q22.1" 

/clone="P131P23" 

/clone_lib="RPCIl, 3-5 PAC library" 
/note= "Accession No. AF190641" 
source 48469 .. 158332 

/organism="Homo sapiens" 
/mol_type=" genomic DNA" 
/db_xref="taxon:9606" 
/ chromosome= "21" 
/map="21q22.1" 
/clone="P209K9" 

/clone_lib="RPCIl , 3-5 PAC library" 
/note= "Accession No. AF096876" 
source 156690 .. 294644 

/organism="Homo sapiens" 
/mol_type=" genomic DNA" 
/ db_xr e f = " t axon : 9 6 0 6 " 
/chromosome= "21" 
/map="21q22.1" 
/clone="R463J19" 

/clone_lib="RPCI-ll BAC library" 
/note= "Accession No. AP000884" 

source 292534 . . }340000 

/organism="Homo sapiens" 
/mol_type=" genomic DNA" 
/db_xref="taxon: 9606" 
/ chromosome^ "21" 
/map="21q22 .1" 
/clone="R680H4, 3' partial" 
/clone_lib="RPCI-ll BAC library" 
/note= "Access ion No. APO 00885" 

repeat_region 1610 . . 1766 

/note="MER50" 

/rpt_family= " LTR/MER4 - group " 

/ rp t_type =DI S PERSED 
repeat_region 1857.. 2051 

/note="AluSx" 

/ rpt_f ami ly = " S INE/ Alu " 

/rpt_type=DISPERSED 
repeat_region 2054.. 2392 

/note="MER50" 

/ rp t_f ami ly= " LTR/MER4 - group 11 

/rpt_type=DISPERSED 
repeat__region 3327. .3518 

/note="MER102" 

/rpt_family="DNA/MERl_type" 

/ rpt_type =DI S PERSED 
repeat_region complement (3582 . .3694) 

/note="L2" 

/rpt_family=»LINE/L2" 

/rpt_type=DISPERSED 
repeat_region 4580. .4614 

/note=" (TTC)n" 

/ rpt_f ami ly= 11 S impl e_r epea t " 

/ rpt__type=TANDEM 
repeat_region 5743 . . 6052 



/note="AluSq" 
/ rp t_f ami ly= " SINE /Alu " 
/fpt_type=DISPERSED 
repeat_region complement (6134 . .6214) 

/note="MIR" 

/rpt_family=" SINE/MIR" 
/rpt_type=DISPERSED 

repeat_region complement (6299 . .6651) 

/note= "MLT1A1 " 
/rpt_f amily= " LTR/MaLR " 
/ rpt_t ype=DI S PERSED 

repeat_region 7531.. 7839 

/note="AluSc" 
/rpt_f amily= « SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region 9145 . . 9173 

/note=" (A)n" 

/rpt_jEamily="Simple_repeat" 
/ rpt_type=TANDEM 
repeat_region complement (10285 . . 10467) 

/note="MIR" 

/ rpt_f ami ly= "SINE/MIR" 
/rpt_type=DISPERSED 

repeat__region 10468.. 10753 

/note="AluJb" 
/rpt_f amily= "SINE/Alu » 
/rpt_type=DISPERSED 

repeat_region 10757 . . 10785 

/note=" (TAAAA) n" 

/ rp t _f ami ly = " S imp 1 e_r epea t " 

/ rpt_type= TANDEM 

repeat_region complement (10786 . . 10847) 

/note="MIR" 

/ rpt_f amily= "SINE /MIR" 
/rpt_type=DISPERSED 

repeat_region 12268. .13078 

/note="LlM4" 
/rpt_f amily= "LINE/LI " 
/rpt_type=DISPERSED 

repeat_region 13293 . . 13411 

/note="LlMEc" 
/rpt_family="LINE/Ll" 
/rpt_type=DISPERSED 

repeat_region 13442 . . 13591 

/note="LlMEc" 
/rpt_family="LINE/Ll" 
/rpt_type=DISPERSED 

repeat_region 13704. .13750 

/note="L2" 

/rpt_f amily="LINE/L2 " 
/rpt_type=DISPERSED 
repeat_region complement (15714 . . 162 07) 

/ note= " HERVL " 

/rpt_family="LTR/Retroviral 
/rpt_type=DISPERSED 
repeat_region complement (16208 . . 16352) 

/note="AluSg/x" 



/ rp t_f ami ly= "SINE / Alu 11 
/rpt_type=DISPERSED 
repeat_region complement (16355 . . 16896) 

/note =" HERVL" 

/rpt_family= " LTR /Retroviral " 
/rpt_type=DISPERSED 

repeat_region complement (16897 . ,17168) 

/note= f, AluSg" 
/ rpt_f ami ly= " S INE/ Alu " 
/rpt_type=DISPERSED 

repeat_region complement (17169 . .17290) 

/ no t e = 11 HERVL 11 

/rpt_f amily= " LTR/Retroviral " 
/rpt_type=DISPERSED 
repeat_region complement (17288 . .17388) 

/note=" HERVL" 

/rpt_f amily= "LTR/Retroviral" 
/rpt_type=DISPERSED 
repeat_region complement (17405 . .17752) 

/not e= "HERVL" 

/rpt_family= "LTR/Retroviral" 
/rpt_type=DISPERSED 
repeat_region 17776 . . 17856 

/note=" (TA)n" 

/ rp t_f ami ly= " S impl e_repea t " 
/ rpt_type=TANDEM 

repea t_region 17875. .17919 

/note=" (TATG)n" 
/rpt_f amily= " Simple_repeat " 
. / rpt_type= TANDEM 

repeat__region 17948.. 18035 

/note="LlM4" 
/rpt_f ami ly= " LINE/LI " 
/rpt__type=DISPERSED 

repeat_region 18040.. 18580 

/note="LlMB8" 
/rpt_family="LINE/Ll" 
/rpt_type=DISPERSED 

repeat_region complement (18692 . . 18805) 

/not e= "HERVL" 

/rpt_f amily= " LTR/Retroviral " 
/rpt_type=DISPERSED 
repeat_region complement (192 87 . .19701) 

/note= n MLT2H " 

/rpt_family="LTR/Retroviral" 
/rpt_type=DISPERSED 

repeat_region 20399. .20763 

/ no t e = 11 MLT 1 A 1 " 
/ rp t_f ami ly = " LTR/MaLR « 
/rpt_type=DISPERSED 

repeat_region complement (20896 . .21252) 

/note="MER69" ' 
/rpt_f amily= " DNA/AcHobo " 
/rpt__type=DISPERSED 

repeat_region complement (21253 . .21393) 

/ no t e= " FLAM_C " 
/ rpt_f ami ly= " S INE/ Alu 11 



repeat_region 



repeat_region 



repeat_region 



repeat_region 



repeat^ region 



repeat_region 



STS 



repeater eg ion 



repeat__region 



repeat_region 



repeater eg ion 



repeat_region 



repeat_region 



repeat_region 



/rpt_type=DISPERSED 

complement (21398 . .21995) 
/note="MER69 !l 
/ rpt_f ami ly= " DNA/AcHobo " 
/rpt_type=DISPERSED 

22187.. 22495 
/note="AluSx" 
/rpt_family="SINE/Alu" 
/rpt_type=DISPERSED 

22496.. 22515 
/note=" (GAAAA) n" 
/rpt_f amily= " Simple_repeat " 
/ rpt_t ype = TANDEM 

23349.. 23361 
/note=" (TAA)n" 
/ rpt_f ami ly = " S imp 1 e_r epe at " 
/ r p t_t ype = TANDEM 

23362. .23559 
/note="AluSg/x" 
/ rpt _f ami ly= " SINE/Alu " 
/rpt_type=DISPERSED 

23560. .23581 
/note=" (TAA)n" 
/rpt_f amily= " Simple_repeat " 
/ rp t_t ype = TANDEM 

23648. .23791 
/ s t andar d_name ="D21S1846" 
/note="Accession No. D42163 
HY21-285aR M 

complement (23846 . .24948) 
/note="MER69" 
/rpt_f amily= " DNA/AcHobo " 
/rpt_type=DISPERSED 

complement (24 949 . .25275) 
/ note= "MER61A" 
/rpt_f ami ly= " LTR/MER4 -group 
/ rpt_type=DI S PERSED 

complement (25279 . .25594) 
/note="MER69 ,! 
/ rpt_f ami ly= " DNA/ AcHobo " 
/rpt_type=DISPERSED 

complement (28100 .. 28223) 
/note="L2 " 

/rpt_f amily="LINE/L2 " 
/rpt_type=DISPERSED 

complement (28403 .. 28727) 
/note="MER33" 

/rpt_f amily= " DNA/MERl_type " 
/rpt_type=DISPERSED 

complement (28823 . .2 8942) 
/note= M L2" 

/rpt_f amily= "LINE/L2 " 
/rpt_type=DISPERSED 

complement (29446 . .29820) 
/ not e = " MLT 1 A2 11 
/rptJEamily="LTR/MaLR" 
/rpt_type=DISPERSED 



repeat_region 29905. .30078 

/ not e = " MER5 B " 

/ rp t_f ami ly = " DNA/MER1 _type " 
/rpt_type=DISPERSED 
repeat_region 30446. .30598 

/note= "MER5A" 

/rpt_f amily= " DNA/MERl_type " 
/rpt_type=DISPERSED 

repeat_region complement (31211 . .31515) 

/note="AluSx" 
/ rp t _f am i 1 y = 1 1 S I NE / Al u " 
/rpt_type=DISPERSED 

repeat_regibn 31729.. 31746 

/note=" (TG)n" 

/rpt_f amily= " Simple_repeat " 
/ rpt_type= TANDEM 
repeat_region 31747. .31783 

/note=" (TG)n tt 

/rpt_f amily= "Simple_repeat 11 

/rpt_type=TANDEM 
repeat_regioii 31813 . .31912 

/note=" (CATATA) n" 

/rpt_f amily="Simple_repeat " 

/ rp t_t ype =TANDEM 
repeat_region complement (32487 . .32636) 

/note="LTR27" 

/ rpt_f amily= "LTR/MER4 -group 
/rpt_type=DISPERSED 
repeat_region complement (32694 . .33198) 

/note="LTR27" 

/ rpt_f ami ly = " LTR/MER4 - group 
/rpt_type=DISPERSED 

repeat_region 33408. .33579 

/note^MLTlD" 
/ rp t_f ami ly= " LTR/MaLR " 
/rpt_type=DISPERSED 

repeat_region 34013.. 34091 

/note="MIR n 

/rpt_family= "SINE/MIR" 
/rpt_type=DISPERSED 

repeat_region 34093 . . 34277 

/note=» (TCTCC)n" 

/rpt_f amily= "Simple_repeat " 

/ rpt_type= TANDEM 

repeat_region 34611. .34876 

/note="LlMCl" 
/rpt_family= "LINE/LI" 
/rpt_type=DISPERSED 

repeat_region complement (35441 . . 35584) 

/ note= " MER5 8B " 
/rpt_family=''DNA/MERl_type" 
/rpt_type=DISPERSED 

repeat_region complement (35585 .. 35767) 

/note="AluY" 
/ rp t_f ami 1 y = " S I NE / Al u " 
/rpt__type=DISPERSED 

repeat_region complement (35768 . . 35824) 



/ not e= "MER58B " 
/rpt_family= " DNA/MERl_type " 
/rpt_type=DISPERSED 
repeat_region 36150 . . 36200 

/note=" (TA) n" 

/rpt_f amily= "Simple_repeat " 
/ rpt_type= TANDEM 

repeat_region complement (37283 . .37594) 

/note="AluY" 
/rpt_family=" SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region 38551. .38618 

/note=" (TTC)n" 
/rpt_f amily= " Simple_repeat " 
/rpt_type=TANDEM 

repeat_region 39174. .39302 

/note="LlMD3" 
/ rp t_f ami ly = " L I NE / L 1 " 
/rpt_type=DISPERSED 

repeat_region complement (39345 . .3 9510) 

/note="LlM4" 
/rpt_family= n LINE/Ll" 
/rpt_type=DISPERSED 

repeat_region 39512 . .39717 

/note="AluY" 
/rpt_f amily= » SINE/Alu " 
/rpt_type=DISPERSED 

repeat_region 39718. .40146 

/note="LTR7 ! * 

/ r P t _£ ami 1 y = " LTR / Re t r o v i r a 1 
/rpt_type=DISPERSED 

repeat_region 40148 . .40244 

/note="AluY" 
/rpt_f amily= " SINE/Alu " 
/rpt_type=DISPERSED 

repeat_region complement (40247 . .40461) 

/note="L!M4" 
/rpt_f amily= "LINE/LI " 
/rpt_type=DISPERSED 

repeat_region complement (40647. .40885) 

/note= n LlMEc ,! 
/rpt_family="LINE/Ll" 
/rpt_type=DISPERSED 

repeat_region complement (40946 . .42900) 

/note="LlM4" 
/rpt_family="LINE/Ll" 
/rpt_type=DISPERSED 

repeat_region 44069. .44189 

/note="AluJb" 
/rpt_family= "SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region . 44190. .44212 

/note=" (CAT) n" 
/ rpt_f amily= " Simple_repeat " 
/ rpt_type= TANDEM 

repeat_region 44215. .44256 

/note="BC200" 



/rpt_f amily= " scRNA" 
/rpt_type=DI SPERSED 
repeat_region 45825. .45891 

/note=" (TG) n" 

/rpt_f amily="Simple_repeat" 
/ rpt_type= TANDEM 
repeat__region 47490.. 48123 

/ no t e = " MER4 IB" 

/ rp t_f ami 1 y= " LTR/MER4 - group " 
/rpt_type=DISPERSED 
repeat_region complement (48305 . .48393) 

/ note= "MER5A" 

/rpt_f ami ly= " DNA/MERl_jtype 11 
/rpt_type=DISPERSED 

repeat_region complement (48394 . .48702) 

/note="AluSg" 
/ rpt_f ami ly= » S INE / Alu " 
/rpt_type=DISPERSED 

repeat_region complement (48703 . .48766) 

/note="MER5A" 

/rpt_family="DNA/MERl_type " 
/rpt_type=DISPERSED 
repeat_region 48800.. 49452 

/note="PABL_A" 

/ rpt_f ami ly= " LTR/MER4 - group " 
/rpt_type=DISPERSED 
repeat_region 49471. .49621 

/note= "MER5B" 

/ rpt_f ami ly= " DNA/MERl_type " 
/ rp t_type=DI SPERSED 
repeat_region complement (49915 . .50054) 

/note="MIR" 

/ rpt_f amily= "SINE /MIR" 
/rpt_type=DISPERSED 
repeat_region 50849 . . 51045 

/note="MIR" 

/ rp t_f ami ly = " S INE /MI R " 
/rpt_type=DI SPERSED 

repeat_region complement (52528 . .52769) 

/not e=" Char liela" 
/ rp t_f ami ly= " DNA/MER l_t ype " 
/rpt_type=DISPERSED 

repeat_region complement (52777 . .53603) 

/note="LlPA13" 
/rpt_family="LINE/Ll" 
/rpt_type=DISPERSED 

repeat__region complement (53604 . .53831) 

/note="Charliela" 
/ rp t_f ami ly = " DNA/MER1 J: ype " 
/rpt_type=DISPERSED 

repeat_region 53832 . . 54122 

/note="AluSg" 
/rpt_family="SINE/Alu" 
/rpt_type=DI SPERSED 

repeat_jregion complement (54123 . .55116) 

/note="Charliela" 
/ rpt_f ami ly= " DNA/MERl_type " 



/rpt__type=DISPERSED 

repeat_region complement (55804 . .55958) 

/note="LlMA2" 
/rpt_f amily= " LINE/LI " 
/rpt_type=DISPERSED 

repeat_region complement (56025 . .56407) 

/note="THElC" 
/rpt_family="LTR/MaLR" 
/rpt_type=DISPERSED 

repeat_region 56596 . . 56615 

/note=" (TTG)n" 
/ r P t _f ami ly = " S imp 1 e_r epe at " 
/ rpt_type= TANDEM 

repeat_region 56906. .57196 

/note= n AluSg" 
/rpt_f amily= "SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region 57276. .57585 

/note="AluSx" 
/ rp t _f ami ly = " S INE / Alu " 
/rpt_type=DISPERSED 

repeat_region complement (57919 . * 58257) 

/note="LlPA14" 
/rpt_family="LINE/Ll" 
/rpt_type=DISPERSED 

repeat_region complement (59538 . .59846) 

/note="AluY" 
/rpt_f amily= " SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region 60284. .60561 

/note="L2" 

/rpt_family="LINE/L2 " 
/rpt_type=DISPERSED 
repeat_region 60878 . . 61021 

/note="MER5A" 

/ rp t_f ami ly= » DNA/MERl_type " 
/rpt_type=DISPERSED 

repeat_region 61022.. 61304 

/note="AluSp" 
/ rp t_f ami ly = "SINE /Alu " 
/rpt_type-DISPERSED 

repeat_region 61305. .61359 

/note=" (TAAAA)n" 

/rpt_f amily= "Simple_repeat " 

/rpt_type=TANDEM 

repeat_region 61361 . .61378 

/note=" MER5A" 

/rpt_f amily= « DNA/MERl_type " 
/rpt_type=DISPERSED 
repeat_region complement (62037 . .62224) 

/note="L2" 

/rpt_family= "LINE/L2 " 
/rpt_type=DISPERSED 
repeat_region 62593 . . 62642 

/note="L2" 

/rpt_family="LINE/L2" 
/rpt_type=DISPERSED 



repeat_region 64995 . . 65340 

/ no t e = " L 1ME 3 A " 
/rpt_f amily= " LINE/LI " 
/rpt_type=DISPERSED 

repeater eg ion complement (65614 . .66155) 

/note="LlM4 " 
/rpt_f amily= "LINE/LI » 
/rpt_type=DISPERSED 

repeat_region complement (66210 . .66282) 

/note="LlME" 
/rpt_family="LINE/Ll" 
/rpt_type=DISPERSED 

repeat_region 66285. .66307 

/note=" (TTTTA) n" 

/rpt_f amily= " Simple_repeat " 

/ rpt__type= TANDEM 

repeat_jregion complement (66308 . .66463) 

/note="AluSg/x" 
/rpt_f amily= " SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region complement (66465 . . 67226) 

/ note — " LIME " 
/rpt_family="LINE/Ll" 
/rpt_type=DISPERSED 

repeat_region 68693 . . 68831 

/ not e= " MER1 04 " 
/rpt_family= " DNA/MERl_type " 
/rpt_type=DISPERSED 

repeat_region complement (70650 . . 70847) 

/note="AluY" 
/rpt_f amily= "SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region complement ( 72537 . .72899) 

/ note= "MLT1A1 " 
/ rp t_f ami ly = " LTR/MaLR " 
/rpt_type=DISPERSED 

repeat_region 73742.. 74087 

/ not e= " THE1B " 
/rpt_family= "LTR/MaLR" • 
/rpt_type=DISPERSED 

repeat_region 75666 . . 76000 

/ not e= " THE1B " 
/rpt_f amily= " LTR/MaLR " 
/rpt_type=DISPERSED 

repeat_region 76064.. 76534 

/note="MLTlD" 
/ rpt_f ami ly= " LTR/MaLR " 
/rpt_type=DISPERSED 

repeat_region complement (77488 . . 77640) 

/note="AluJo n 
/ rpt_f ami ly = " S INE/Alu " 
/rpt_type=DISPERSED 

repeat_region complement (78204 . . 78675) 

/note="MLTlH" 
/ rp t_f ami ly = " LTR/MaLR " 
/ rpt_type=DISPERSED 

repeat_region 79704. .79724 



/note= " (TG) n" 

/ rp t_f ami ly = " S impl e_repeat " 
/ rpt_type= TANDEM 

repeat_region complement (79859 . .80105) 

/note="AluSg" 
/rpt__f amily= " SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region 81177. .81331 

/note="MER53" 
/rpt_f amily= " DNA" 
/rpt_type=DISPERSED 

repeat_region 82207.. 82702 

/note="MER66B" 

/rpt_f ami ly= " LTR/MER4 -group " 
/rpt_type=DISPERSED 

repeat_region 84291 . . 84769 

/ note= H LIMEc " 
/rpt_f amily= " LINE/LI " 
/rpt_type=DISPERSED 

repeat_region 84895.. 86159 

/note="LlMEc" 
/rpt_family= ! 'LINE/Ll" 
/rpt_type=DISPERSED 

repeat_region complement (86173 . .86603) 

/note="MLTlC" 
/rpt_f amily= " LTR/MaLR" 
/ rp t_t ype=DI S PERSED 

repeat_region 87059.. 87083 

/note=" (TG) n" . 
/rpt_family="Simple_repeat" 
/rpt_type=TANDEM 

repeat_region 88474.. 88507 

/note=" (TC)n 1 ' 

/rpt_f amily= " Simple_repeat " 
/rpt_type=TANDEM 
repeat_region 88508.. 8 85 27 

/note=" (CA)n" 

/rpt_f amily="Simple_repeat " 
/rpt_type=TANDEM 

repeat_region 88596 . . 89068 

/ note= "MLT1F1 " 
/ rp t_f ami ly = " LTR/MaLR " 
/rpt_type=DISPERSED 

repeat_region 89598.. 89648 

/note=" (CATA)n" 
/ rpt__f amily= " Simple__repeat " 
/rpt_type=TANDEM 

repeat_region 91581 . . 92953 

/note="LlPA8» 
/rpt_f amily= " LINE/LI " 
/rpt_type=DISPERSED 

repeat__region 93138. .93279 

/note= M L2 n 

/rpt_f amily= » LINE/L2 " 
/rpt_type=DISPERSED 
repeat_region complement (94478 . .94637) 

/note="MER5A" 



/rpt_f amily= " DNA/MERl_type " 
/rpt_type=DISPERSED 
repeat_region 94800. .94858 

/note="Ll" 

/rpt_family="LINE/Ll" 
/rpt_type=DISPERSED 

repeat__region 94855 . . 95590 

/note="LlPA16 M 
/rpt_f ami ly= " LINE/LI " 
/rpt_type=DISPERSED 

repeat_region complement (95618 . . 95915) 

/note="L2" 

/rpt_family="LINE/L2" 
/ rpt_t ype=DI S PERSED 

STS 96719. .97040 

/standard_name="D21S93" 
/db_xref = " GDB : 1 92 0 0 3 " 

repeat_region 97850.. 97997 

/note= "MER5A" 

/rpt_family= » DNA/MERl_type " 
/rpt_type=DISPERSED 
repeater egion 98340. .98603 

/note="L2" 

/rpt_family="LINE/L2" 

/rpt_type=DISPERSED 
repeat_region 100248 .. 100273 

/note=" (TCTA) n" 

/rpt_f amily= " Simple_repeat " 

/ rpt_type = TANDEM 
repeat_region 101340 . .101376 

/note=" (CA)n" 

/rpt_f amily= " Simple_repeat " 
/ rpt_type = TANDEM 

repeat_region 101803 . . 101903 

/note="MADEl" 
/rpt__f amily= " DNA/Mar iner " 
/rpt_type=DISPERSED 

repeat_regioh complement (104577 . .104851) 

/note="AluY" 
/ rpt_f ami ly = "SINE / Alu " 
/rpt_type=DISPERSED 

repeat_region 104868 .. 104994 

/not e=" Alu Jo" 
/rpt_f amily= " SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region complement (104995 . .105295) 

/note="AluSg" 
/rpt_f ami ly= " S INE/Alu " 
/rpt_type=DISPERSED 

repeat_region 105296 .. 105496 

/note="AluJo" 
/rpt_family=" SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region 106248 . . 106271 

/note=" (CA)n" 

/rpt_f amily= " Simple_repeat " 
/ rp t _t ype =TANDEM 



repeat_region complement (106998 . .107232) 

/note="LlM4 " 
/rpt_f amily= " LINE/LI " 
/rpt_type=DISPERSED 

repeat_region complement (107236 .. 108368) 

/note="LlMB8" 
/rpt_f ami ly= " LINE/LI " 
/rpt_type=DISPERSED 

repeat_region 110671 . . 110915 

/note="MIR" 

/ rpt_f amily= "SINE/MIR" 
/rpt_type=DISPERSED 

repeat_region complement (111090 . .111456) 

/note="MLTlB" 
/rpt__f amily= " LTR/MaLR" 
/rpt_type=DISPERSED 

repeat_region 116374 . . 116397 

/note=" (GGAA)n" 
/rpt_f amily= " Simple_repeat " 
/ rpt_type=TANDEM . 

repeat_region 117041. .117786 

/note="L2" 

/rpt_family="LINE/L2 " 
/rpt_type=DISPERSED 

repeat_region 117922 . . 117945 

/note=" (TTTTG) n" 
/rpt_f amily= "Simple_repeat " 
/ rpt_type=TANDEM 

repeat_region complement ( 117949 . .118210) 

/note="AluSg" 
/rpt_family= " SINE/Alu " 
/ rpt_type=DI SPERSED 

repeat_region 118329 . . 118723 

/note="L2 " 

/rpt_f amily= "LINE/L2 " 
/rpt_type=DISPERSED 
repeat_region complement (119645 . .119734) 

/note= n L2" 

/rpt_f amily="LINE/L2 11 
/rpt_type=DISPERSED 

repeat_region complement (122484 . .122818) 

/note="LlMAl" 
/rpt_family="LINE/Ll" 
/rpt_type=DISPERSED 

repeat_region complement (122 910 . .123004) 

/note="MER112" 
/rpt_f amily= » DNA/MERl_type " 
/rpt_type=DISPERSED 

repeat_region 123306. .123609 

/note="AluSg M 
/ rp t_f ami ly= " S INE/Alu " 
/ rpt _typ'e=DI SPERSED 

repeat_region 123733 . . 123874 

/note="MER103" 
/ rpt_f ami ly= » DNA » 
/rpt_type=DISPERSED 

repeat_region complement (124583 . . 124659) 



/ note= "MER5A" 

/ rp t_f ami ly= " DNA/MERl_type " 
/rpt_type=DISPERSED 
repeat_region complement (125311 . .125442) 

/note= n MIR n 

/ rp t_f ami ly= "SINE/MIR" 
/rpt_type=DISPERSED 

repeat_region 126521 . . 126969 

/note="LlM4 " 
/rpt_f amily= " LINE/LI " 
/rpt_type=DISPERSED 

repeat_region complement (127168 .. 127470) 

/note="AluSx" 
/rpt_family="SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region 127973 . . 128249 

/note="AluSx n 
/rpt_f amily= " SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region 128250 . . 128276 

/note=" (CAAAA) n" 
/rpt_family= "Simple_repeat " 
/ rpt_type= TANDEM 

repeat_region complement (12 8361 . .129106) 

/note=" LTR1B" 

/rpt_family= "LTR/Retroviral" 
/rpt_type=DISPERSED 
repeat_region 12 9465 . . 12 9551 

/note= "MER5A" 

/rpt_f amily= " DNA/MERl_type " 
/rpt_type=DISPERSED 

repeat_r egion complement (13 0876 . .130938) 

/note="LlPB2" 
/rpt_family="LINE/Ll" 
/rpt_type=DISPERSED 

repeat_region complement (130940 . ,131303) 

/note="THElC" 
/rpt_family="LTR/MaLR" 
/rpt_type=DISPERSED 

repeat^region complement (131311 . .132911). 

/note= " THE1C- internal " 
/ rp t_f ami ly = " LTR/MaLR « 
/rpt_type=DISPERSED 

repeat_region complement (132914 . .133274) 

/ note= " THE1C" 
/rpt_f amily= "LTR/MaLR " 
/rpt_type=DISPERSED 

repeat_region complement (133282 . .134013) 

/note="LlPB2" 
/rpt_family="LINE/Ll" 
/rpt_type=DISPERSED 

repeat_region 134012 .. 134613 

/note="LlPB2" 
/rpt_family="LINE/Ll" 
/rpt_type=DISPERSED 

STS 135457. .135593 

/ s t anda r d_name ="D21S1695" 



/note="232C7aL 
Accession No. D38269" 

repeat_region complement (136167 .. 136278) 

/note="MLTH" 
/ rp t_f ami ly= » LTR/MaLR " 
/rpt_type=DISPERSED 

repeat_region complement (138259 .. 139869) 

/note="L2 " 

/rpt_f amily="LINE/L2 11 
/rpt_type=DISPERSED 

repeat_region complement (139966 . .140245) 

/not e=" Alu Jo" 
/rpt_f amily= " SINE/Alu " 
/rpt_type=DISPERSED 

repeat_region complement (140496 . .141823) 

/note="L2" 

/ rp tJE ami ly = " L INE /L2 " 
/rpt_type=DISPERSED 
repeat_region 142802 . . 142991 

/ not e = " MER4 1A " 

/ rpt_f amily= " LTR/MER4 -group " 
/rpt_type=DISPERSED 

repeat_region 142992 . .143282 

/note="AluSc" 
/ rp t_f ami ly= " S INE/ Alu " 
/rpt_type=DISPERSED 

repeat_regiori 143283 . . 143655 

/note= "MER41A" 

/rpt_f amily= "LTR/MER4 -group " 
/rpt_type=DISPERSED 
repeatjregion 143854 .. 144372 

/note= "MER4B" 

/ f p t_f ami ly = " LTR/MER4 - group " 
/rpt_type=DISPERSED 

repeat_region complement (144887 . .145196) 

/note="AluY" 
/rpt_f amily= » SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region complement (145960 . . 146456) 

/note= H LTR19A" 
/ rpt_f ami ly= " LTR " 
/rpt_type=DISPERSED 

repeat_region complement (148453 . .148757) 

/not e=" Alu Jo 11 
/rpt_family= "SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region complement (149220 . . 149721) 

/ note= "MLT1F" 
/rpt_f amily= "LTR/MaLR" 
/rpt_type=DISPERSED 

repeat_region complement (149774 . . 149906) 

/ not e = 11 FLAM_C " 
/ rp t_f ami 1 y = " S INE / Alu " 
/rpt_type=DISPERSED 

repeat_region complement (150819 . .151269) 

/note="L2 " 

/rpt_family="LINE/L2 " 



/rpt_type=DISPERSED 
repeat_region complement (151444 . .151618) 

/note="L2 l » 

/rpt_family="LINE/L2" 
/rpt_type=DISPERSED 
repeat_region 151947. .152069 

/note="L2" 

/rpt_family="LINE/L2" 
/rpt_type=DISPERSED 
repeat_jregion complement (152112 . .1522 05) 

/note="L2" 

/rpt_family="LINE/L2 n 

/rpt_type=DISPERSED 
repeat_region 152215 . . 152383 

/note=" (TCTA)n" 

/rpt_f amily= " Simple_repeat " 

/ rpt_type= TANDEM 
repeat_region complement (153 794 . .153962) 

/ note= "MER5A" 

/rpt_f ami ly= " DNA/MERl_type 11 
/rpt_type=DISPERSED 

repeat_region complement (154253 . .154624) 

/note="LlM2" 
/rpt_f amily= " LINE/LI " 
/rpt_type=DISPERSED 

repeat_region complement (155818 . .156174) 

/note= "THE1B" 
/ rp t_f ami 1 y = " LTR/MaLR " 
/rpt_type=DISPERSED 

repeat_region complement (156354 . .156451) 

/note="AluYb8" 
/rpt_family="SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region complement (156453 . .156484) 

/ note= "MLT1H " 
/rpt_family= "LTR/MaLR" 
/rpt_type=DISPERSED 

repeat_region 157012 .. 157102 

/nbte="MER94" 
/ rpt_f ami ly= » DNA " 
/rpt_type=DISPERSED 

repeat_region complement (157361 . .157467) 

/note="MIR" 

/ rpt__f amily= "SINE/MIR" 
/rpt_type=DISPERSED 
repeat_region 157517 . . 157596 

/note="MIR" 

/ r p t _f ami ly=" SINE/MIR" 
/rpt_type=DISPERSED 

repeat_region 157618 157705 

/note=" (TATATG) n" 
/rpt_f amily= " Simple_repeat " 
/ rpt_type= TANDEM 

repeat_region complement (159423 . . 159610) 

/ note= " LTR3 3A " 

/rpt_family= " LTR/Retroviral" 
/rpt_type=DISPERSED 



repeat_region 159865 .. 159943 

/note="MER97c" 

/ rp t_f ami ly= 11 DNA/MERl_type ? " 

/rpt_type=DISPERSED 
repeat_region 160006 . . 160108 

/note="MER97a" . 

/rpt_f amily="DNA/MERl_type? " 

/rpt_type=DISPERSED 
repeat_region 160309 .. 160470 

/note=" (TCTA)n" 

/ rp t_f ami ly = " S imp 1 e_r epea t " 

/ rpt__type=TANDEM 
repeat_region complement (160471 . . 160878) 

/ not e= "MLT1B " 

/ rp t_f ami ly= " LTR/MaLR " 

/rpt_type=DISPERSED 
repeat_region complement (160891 . .161131) 

/note="MER97a" 

/rpt_family="DNA/MERl_type?" 
/rpt_type=DISPERSED 
repeat_region 161890 162041 

/note="MIR" 

/ rpt_f amily= "SINE/MIR" 
/rpt_type=DISPERSED 
repeat_region complement (162554 . .162614) 

/note="MER5B" 

/rpt_family="DNA/MERl_type" 

/rpt_type=DISPERSED 
gene complement (164968 .. 165642) 

/gene="CLDN17" 
mRNA complement (164968. .165642) 

/gene="CLDN17" 

/note= "human gene for claudin-17, Accession No. AJ250712" 
CDS complement (164968. .165642) 

/gene= ,, CLDN17" 

/note= "human gene for claudin-17, Accession No. AJ250712" 

/codon_start=l 

/protein_id="BAA95566.1" 

/db_xref="GI: 7768787" 
exon complement (164968 . .165642) 

/gene="CLDN17" 

/number =1 
repeat_region 167617 . . 167889 

/note="AluJb" 

/rpt_family="SINE/Alu" 

/rpt_type=DISPERSED 
repeat_region 168032 . . 168186 

/note="L2" 

/rpt_family= ,f LINE/L2 " 

/rpt_type=DISPERSED 
repeat_region 168195 .. 168527 

/note="LlPA16" 

/rpt_family="LINE/Ll" 

/rpt_type=DISPERSED 
repeat_region complement (168528 . . 168606) 

/note="AluSc n 

/rpt_family="SINE/Alu" 



/ rpt_type=DI SPERSED 
repeat_region 168648 .. 168679 

/note=" (TAAAA)n" 

/ rpt_f amily= " Simple_repeat " 

/ rp t_t ype = TANDEM 
repeat_region .168928. .169108 

/note="MIR" 

/rpt_family="SINE/MIR« 
/rpt_type=DISPERSED 
repeat_region 169447 . . 169805 

/note= " LTR16A" 

/ rpt_f amily= 11 LTR/ Retroviral 11 
/rpt_type=DISPERSED 
repeat_region 169861 170088 

/note="L2 " 

/rpt_f amily="LINE/L2 " 
/rpt_type=DISPERSED 
repeat_region complement (170145 . .170646) 

/note="L2" 

/rpt_f ami ly=" LINE /L2 " 
/rpt_type=DISPERSED 

repeat_region 170647 . . 170948 

/note="AluSg[ I! 
/ rpt_f ami ly= " S INE / Alu " 
/rpt_type=DISPERSED 

repeat_region complement (170949 . .171405) 

/note="L2 " 

/rpt_f amily= " LINE/L2 " 

/rpt^type=DISPERSED 
repeat_region 171759 171785 

/note= n (TTTG)n" 

/ rp t_f ami ly = " S impl e_r epe a t " 

/ rp t_t ype = TANDEM 
repeat_region complement (171786 . .172071) 

/note="AluSx" 
. /rpt^family^'SINE/Alu" 

/rpt_type=DISPERSED 
repeat_region 172208 . . 172233 

/note=" (CA)n" 

/rpt_f amily= "Simple__repeat " 
/ rpt_t ype =TANDEM 
repeat__region 172349 . . 172598 

/note="L2 " 

/rptjE amily="LINE/L2 11 
/rpt_type=DISPERSED 
repeat_region 172646. .172710 

/note="L2 ™ 

/rpt_f amily= « LINE/L2 " 
/rpt_type=DISPERSED 
repeat_region 173181 . . 173215 

/note=" (T)n" 

/rpt__family= ,, Simple_repeat n 
/ rp t_t ype = TANDEM 
repeat_region 173639. .173767 

/note="MER5B" 

/rpt_f amily= " DNA/MERl_type " 
/rpt_type=DISPERSED 



repeat_region 173799 . . 173859 

/note="MER5B" 

/ rpt_f ami ly= " DNA/MERl_type " 
/rpt_type=DISPERSED 
repeat_region 173877 .. 174053 

/note="MER5B" 

/rpt_family= " DNA/MERl_type 11 
/rpt_type=DISPERSED 

repeat_region complement (175682 . .175736) 

/note="LlMC3" 
/ rp t_f amily= "LINE /LI " 
/rpt_type=DISPERSED 

repeat_region complement (175737 . .176033) 

/note="AluSx" 
/rpt_f amily= » SINE/Alu" 
/rpt__type=DISPERSED 

repeat_region complement (176034 . .176617) 

/note="LlMC3 " 
/ rp t_f ami ly = " LINE /LI " 
/rpt_type=DISPERSED 

repeat_region complement (176621 . .176802) 

/note="LlMC3" 
/rpt_family="LINE/Ll" 
/rpt_type=DISPERSED 

repeat_region complement (176803 . .177103) 

/note="AluJb" 
/ rp t_f ami ly = » S I NE / Alu " 
/rpt_type=DISPERSED 

repeat_region complement (1771 04.. 17 72 19) 

/note="LlMC3" 
/rpt_f ami ly= » LINE/LI » 
/rpt_type=DISPERSED 

repeat_region 178433 . . 178794 

/note="MLTlD" 
/ rp t_f ami ly = » LTR /MaLR " 
/rpt_type=DISPERSED 

repeat_region 178805 . . 179028 

/note="L2" 

/rpt_f amily="LINE/L2 " 
/ rpt_t ype =DI SPERSED 
repeat_region complement (179612 . .179652) 

/note="MIR" 

/ rpt__f amily= "SINE /MIR" 
/rpt_type=DISPERSED 
repeat_region 179802 . . 179976 

/note="MER2 0" 

/ rp t_f ami ly= " DNA/MERl_t ype " 
/rpt_type=DISPERSED 
repeat_region complement (179979 . .180204) 

/note="MIR" 

/rpt_family= "SINE/MIR" 
/rpt_type=DISPERSED 
repeatjregion complement (181241 . .181414) 

/note="MIR" 

/ rpt__f amily= "SINE/MIR" 
/rpt_type=DISPERSED 
repeat_jregion 181545 . . 181591 



/ no t e = " MER2 OB" 
/rpt_f amily= " DNA/MERl_type " 
/rpt_type=DISPERSED 
repeat_region 181592 . . 181632 

/note=" (TG)n ,f 

/rpt_f amily= " Simple_repeat " 

/rpt_type=TANDEM 
repeat_region 181633. .182293 

/note="MER2 0B n 

/rpt_family= " DNA/MERl_type " 

/rpt_type=DISPERSED 
repeat_region complement (182376 . .182500) 

/note="L2" 

/rpt_family="LINE/L2 " 
/rpt_type=DISPERSED 
repeat_region 182510 . . 182779 

/ not e= 11 LTR3 7B " 

/rpt_f amily= "LTR/MER4 -group" 
/rpt_type=DISPERSED 
repeat_region 182818 . .182 897 

/not e= " LTR3 7B " 

/rpt_family="LTR/MER4- group" 
/rpt_type=DISPERSED 

repeat_region 183294 . . 183587 

/note="AluSq" 
/ rp t_f ami ly = " S INE / Alu " 
/rpt_type=DISPERSED 

repeat_region 184702 . . 184877 

/note="HALl" 

/rpt_f amily= » LINE/Other ■ 
/rpt_type=DISPERSED . 

repeat_region 185141 185593 

/note= "MLT1D" 
/rpt_family="LTR/MaLR" 
/rpt_type=DISPERSED 

repeat_region 186761 .. 186852 

/note="LlPA8" 
/rpt_f amily= "LINE/LI " 
/rpt_type=DISPERSED 

repeat_region 188430 . . 188561 

/note= f, LlPA15" 
/rpt_f amily= " LINE/LI " 
/rpt_type=DISPERSED 

repeat_region complement (188895 . .189066) 

/note="MLTlJ" 
/rpt_family="LTR/MaLR" 
/rpt_type=DISPERSED 

repeat_region complement (189265 . .189339) 

/note="MLTU" 
/rpt_family= "LTR/MaLR" 
/rpt_type=DISPERSED 

repeat_region 189795. .190352 

/note="L2" 

/rpt_f amily= "LINE/L2 » 
/rpt_type=DISPERSED 
repeat_region complement (190366 . .190506) 

/note= "MER5B " 



/rpt_f amily= "DNA/MERl_type" 
/rpt_type=DISPERSED 
repeat_region complement (190558 . . 190629) 

/ no t e =." MER4 A2 » 

/ rpt_f ami ly= " LTR/MER4 - group " 
/rpt_type=DISPERSED 
repeat_region complement (190625 . .190745) 

/ note= "MER4A" 

/rpt_family= "LTR/MER4 -group" 
/rpt_type=DISPERSED 

repeat_region complement (190777. .191146) 

/note= "MLT1D" 
/rpt_f amily= " LTR/MaLR" 
/rpt_type=DISPERSED 

repeat_region complement (191227 . .191346) 

/note="MLTlD" 
/ rpt_f ami ly= » LTR/MaLR " 
/rpt_type=DISPERSED 

repeat_region 191353 . . 192586 

/note="L2" 

/rpt_family="LINE/L2" 
/rpt_type=DISPERSED 

repeat_region complement (192679 . .193267) 

/note="MER89" 
/rpt_family="LTR" 
/rpt_type=DISPERSED 

repeat_region 193396 . .193432 

/note="L2" 

/ rp t_f ami ly = " LINE / L2 " 
/rpt_type=DISPERSED 
repeat_region complement (19422 8 . .194449) 

/note="MIR" 

/ rpt_f amily= "SINE /MIR" 
/rpt_type=DISPERSED 
repeat_region 194909 .. 194951 

/note="L2" 

/rpt_family= "LINE/L2 11 
/ rp t_type =D I S PERSED 
repeat_region 195010. .195029 

/note=" (TG) n" 

/rpt_f amily= " Simple_repeat " 
/ rpt_type=TANDEM 
repeat_region 196293 . . 196403 

/note="MIR» 

/ rpt__f ami ly= 11 SINE /MIR " 
/rpt_type=DISPERSED 

repeat_region 197062 . . 197425 

/note="LlMA5 " 
/rpt_f amily= "LINE/LI " 
/rpt_type=DISPERSED 

repeat_region complement (197429 . . 197480) 

/note="LlP» 
/rpt_f amily= "LINE/LI " 
/rpt_type=DISPERSED 

repeat_region 197479 .. 198709 

/note= f, LlP" 
/rpt_f amily= " LINE/LI " 



/rpt_type=DISPERSED 

repeat_region 198710 . . 198984 

/note="LlPA7 « 
/rpt_f amily= "LINE/LI " 
/rpt_type=DISPERSED 

repeat_region 198999 . . 199018 

/note=" (TAAAA)n" 
/rpt_family="Simple_repeat" 
/ rpt_type= TANDEM 

repeat_region complement (199526. .200246) 

/note="LlPA7 » 
/rpt_f amily= " LINE/LI " 
/rpt_type=DISPERSED 

repeat_region complement (200247 . .201488) 

/note="LlP» 
/rpt_f amily= "LINE/LI " 
/rpt_type=DISPERSED 

repeat_region complement (201590 . .201653) 

/note="MIR" 

/ rpt_f amily= "SINE/MIR" 
. /rpt_type=DISPERSED 

repeat_region 202619 . .202926 

/not e=" Alu Jo" 
/rpt_f amily= « SINE/Alu" 
/rpt_type=DISPERSED 

repeatjregion complement ( 2 0 3 13 1 . . 2 0 3264 ) 

/note= M FLAM_C " 
/ rp t _f ami 1 y = " S INE / Alu " 
/ rpt_type=DI SPERSED 

repeatjregion 203305. .203433 

/ not e = " FLAM_A " 
/ rpt_f ami ly= " S INE/Alu " 
/rpt_type=DISPERSED 

repeat_region 203607. .203646 

/note=" «3A)n" 

/ rpt_f amily= 11 Simple_repeat " 
/ rpt_type=TANDEM 
repeat_region complement (203963 . .204053) 

/note="MER67C" 

/rpt_f amily= "LTR/MER4 -group" 
/rpt_type=DISPERSED 

repeat_region complement (204054 . .204361) 

/note="AluSx" 
/rpt_f amily= » SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region complement (2 04362 . .204813) 

/note="MER67C" 

/ rpt_f ami ly = " LTR/MER4 -group " 
/rpt_type=DISPERSED 

repeat_region complement (204820 . .205121) 

/note="AluSx" 
/ rp t_f ami ly = "SINE /Alu " 
/rpt_type=DISPERSED 

repeat_region 205193 . . 205219 

/note=" (TTG)n" 
/rpt_f amily= " Simple_repeat " 
/ rpt_type= TANDEM 



repeat_region complement (205225 . .205419) 

/note="LlPA4" 

/rpt_f amily= "LINE/LI " 

/rpt_type=DISPERSED 
repeat_region 205492 . .205569 

/ note= "MLT1G1 " 

/rpt_family="LTR/MaLR" 

/rpt_type=DISPERSED 
repeat_region 205751 .. 205854 

/note= "MLT1H" 

/ rp t _f ami ly = " LTR/MaLR " 

/rpt_type=DISPERSED 
repeat_region complement (2 05868 . .2 06401) 

/note= " LTR26E 11 

/ rp t_f ami ly = " LTR/Re t rovi ral " 

/rpt_type=DISPERSED 
repeat_region 207212 .. 207472 

/note="AluJo» 

/rpt_family="SINE/Alu" 

/rpt_type=DISPERSED 
repeat_region 207480 . ,207656 

/note=." (GAAA)n" 

/rpt_family="Simple_repeat " 

/ rp t_t ype = TANDEM 
repeat_region 209242 .. 209271 

/note=" (TA)n" 

/rpt_f ami ly= " Simp le_repeat " 
/ rp t__t ype = TANDEM 
repeat_region 209299 .. 209466 

/note=" (TA)n" 

/rpt_f ami ly= " S impl e_repeat 11 
/ rpt_type=TANDEM 
repeat_region 209490 .. 209519 

/note= " (TA) n" 

/ rpt_f ami ly= " S impl e_repeat " 
/ rpt_type=TANDEM 
repeat_region 209588. .209619 

/note=" (TA) n" 

/rpt_f amily= "Simple_repeat 11 
/ rpt_type=TANDEM 
repeat_region complement (210077 . . 210309) 

/note="MIR" 

/rpt_family="SINE/MIR" 
/ rpt_type =D I S PERSED 
repeater eg ion compl ement (211664. .211851) 

/note="L2" 

/ rpt_f amily= " LINE/L2 " 

/rpt_type=DISPERSED 
repeat_region 212184 .. 212355 

/note="AluSg/x n 

/rpt_f amily= " SINE/Alu" 

/rpt_type=DISPERSED 
gene complement (213042 . .214972) 

/gene="CLDN8" 
mRNA complement (213042. .214972) 

/gene="CLDN8" 

/note="human gene for claudin-8, Accession No. AJ250711" 



exon complement (213042 . .214972) 

/gene="CLDN8" 
/number=l 

5'UTR complement (213042 . .214272) 

/gene="CLDN8" 
polyAjsignal complement (21.3 055 . .213060) 

/gene="CLDN8" 
CDS complement (214273 . .214950) 

/gene="CLDN8" 

/note="human gene for claudin-8, Accession No. AJ250711" 

/ codon_s t ar t = 1 

/pro t e in_id= " BAA9 5 5 6 7 . 1 " 

/db_xref="GI: 7768788" , 
5'UTR complement (214951. .214972) 

/gene="CLDN8 " 
repeat_region 216645. .216952 

/note="AluY" 

/rpt_family="SINE/Alu" 

/rpt_type=DISPERSED 
repeat_region complement (218710 ... 219021) 

/note="AluSx" 

/ rp t_f ami ly= " S INE/Alu " 

/rpt_type=DISPERSED 
repeat__region complement (219103 . . 219477) 

/note="MLTlA2 " 

/ rp t_f ami ly= " LTR/MaLR " 

/rpt_type=DISPERSED 
repeat_region complement (219991 . .220158) 

/note» "MIR" 

/ rpt_f amily= "SINE/MIR" 
/rpt_type=DISPERSED 
repeat_region complement (221295. .221472) 

/note="MIR" 

/ rpt_f amily= "SINE/MIR " 
/rpt_type=DISPERSED 
repeat_region 221763 .. 221790 

/note-" (CA)n" 

/rpt_f amily= " Simple__repeat " 
/rpt_type= TANDEM 
repeat_region complement (222287 .. 222349) 

/note="L2" 

/rpt_family="LINE/L2 " 
/rpt_type=DISPERSED 
repeat_region 223032 .. 223125 

/note="L2" 

/rpt_family="LINE/L2" 
/rpt_type=DISPERSED 
repeat_region 223233 . .223386 

/note="L2" 

/ rpt_f amily= "LINE/L2 " 
/rpt_type=DISPERSED 
repeat_region complement (225324 .. 225394) 

/note=»L2" 

/rpt__family="LINE/L2" 
/rpt_type=DISPERSED 
repeat_region 225395 . .225716 

/note="AluSx" 



/rpt_family= "SINE/Alu" 
/rpt_type=DISPERSED 
repeat_region complement (225717 . .225791) 

/note="L2" 

/rpt_family= "LINE/L2 " 
/rpt_type=DISPERSED 

repeat_region 225866. .226083 

/note="MER102" 
/rpt_f amily= " DNA / MER l_t ype n 
/rpt_type=DISPERSED 

repeat_region 227539. .227832 

/note="AluJb" 
/ rp t_f ami ly = "SINE / Alu " 
/rpt_type=DISPERSED 

repeat_region 227864 227891 

/note=" (A)n n 

/rpt_f amily= " Simple_repeat " 
/ rpt_type=TANDEM 
repeat_region 228396. .228475 

/note=" (TA) n" 

/rpt_f amily= " Simple_repeat " 
/ rpt_type= TANDEM 

repeatjregion 230433 .. 230722 

/note="AluSx" 
/ rp t_f ami ly= " S INE/ Alu " 
/rpt_type=DISPERSED 

repeat_region 230885 . .231744 

/note="LlPA16" 
/rpt_family="LINE/Ll" 
/rpt_type=DISPERSED 

repeat_region 231745. .232185 

/note="LlPA16" 
/rpt_family="LINE/Ll" 
/rpt_type=DISPERSED 

repeat_region complement (233642 . .233846) 

/hote="MIR" 

/rpt_f amily= ■■ SINE /MIR" 
/rpt_type=DISPERSED 
repeat_region complement (234030 . .234188) 

/note= fI MIR" 

/rpt_family= "SINE/MIR" 
/rpt_type=DISPERSED 
repeat_region complement (235129 . .235292) 

/note="LTR23" 

/rpt_f amily= " LTR/MER4 - group " 
/rpt_type=DISPERSED 

repeat_region complement (235293 . .235575) 

/note="AluSx" 
/ rpt_f ami ly= " S INE /Alu " 
/rpt_type=DISPERSED 

repeat_region complement (235576 . .235617) 

/note= ,t LTR23" 

/rpt^f ami ly= " LTR/MER4 - group " 
/rpt_type=DISPERSED 
repeat_region complement (235618 . .235925) 

/note="AluSg" 
/ rp t_f ami ly= " S INE / Alu " 



/rpt_type=DISPERSED 
repeat_region complement (235926 . .236188) 

/note="LTR23" 

/rpt_family= " LTR/MER4 -group " 
/rpt_type=DISPERSED 

repeat_region 236576 . .236642 

/note=" (TTTTC) n" 
/ rpt_f amily= " Simple_repeat " 
/ rpt_type=TANDEM 

repeat_region complement (236644 . . 236951) 

/note="AluY" 
/rpt_family=:"SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region 239604. .239738 

/note="LTR47A" 

/rpt_f amily= " LTR/ Retroviral " 
/rpt_type=DISPERSED 

repeat_region complement (239739 . .239890) 

/note= " FLAM_C " 
/rpt_family="SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region 239891. .240i56 

/ note= " LTR4 7A" 

/ rp t_f ami ly = " LTR/Re t rovi ral " 
/rpt_type=DISPERSED 

repeat__region 240273 . . 240301 

/note=" (TTTG)n" 
/ rp t_f ami 1 y = " S imp 1 e_r epea t " 
/rpt_type=TANDEM 

repeat_region complement (240307 . . 240584) 

/note="AluSx" 
/rpt_family="SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region complement (240682 . .240800) 

/note="MIR" 

/ rpt_f amily= " SINE /MIR" 
/rpt_type=DISPERSED 
repeat_region complement (242367 . .242423) 

/note="Alu" 

/rpt_f amily= " SINE/Alu " 
/rpt_type=DISPERSED 

repeat_region 242425 . . 242635 

/note="AluSg/x f ' 
/ rp t_f ami ly= "SINE / Alu " 
/rpt_type=DISPERSED 

repeat_region 243792 . .244003 

/note="LlME" 
/rpt_f amily= "LINE/LI " 
/rpt_type=DISPERSED 

repeat_region complement (244757 . .244991) 

/note="AluJb n 
/ rp t_f ami ly = "SINE / Al u " 
/rpt_type=DISPERSED 

repeat_region 244994. .245575 

/ not e= " L1ME3A" 
/rpt_f amily= "LINE/LI " 
/rpt_type=DISPERSED 



repeat_region 245576 245881 

/note="AluJo M 
/ rpt_f ami ly= » S INE / Alu " 
/rpt_type=DISPERSED 

repeat_region 245882. .245960 

/ note= " L1ME3A" 
/rpt_f amily= » LINE/LI " 
/rpt_type=DISPERSED 

repeat_region 246003 . . 246717 

/note="LlME3A" 
/rpt_f amily= " LINE/LI " 
/rpt_type=DISPERSED 

repeat_region 246750 246919 

/not e= " L1ME3A" 
/rpt_family="LINE/Ll" 
/rpt_type=DISPERSED 

repeat__region 246920. .246995 

/note=" FRAM/FAM" 
/rpt_f ami ly = "SINE /Alu " 
/rpt_type=DISPERSED 

repeat_region • 247035 . . 247344 

/ not e = " L 1ME 3 A " 
/rpt_f amily= "LINE/LI " 
/rpt_type=DISPERSED 

repeat_region complement (247345 . .247423) 

/ note= "MADE1 11 
/rpt_f amily= 11 DNA/Mariner" 
/rpt_type=DISPERSED 

repeat_region 247424 .. 247561 

/ note= " L1ME3A" 
/rpt_family="LINE/Ll" 
/rpt_type=DISPERSED 

repeat_region 247909. .248045 

/note="MSTA" 
/rpt_family= "LTR/MaLR" 
/rpt_type=DISPERSED 

repeat_region 248103 . . 248149 

/note="MSTA" 
/rpt_family= " LTR/MaLR " 
/rpt_type=DISPERSED 

repeat_region 249668 . . 250027 

/note="THElB" 
/rpt_f ami ly= " LTR/MaLR " 
/rpt_type=DISPERSED 

repeat_region complement (250448 . .250819) 

/ not e = " MLT2 FB " 

/rpt_f amily= 11 LTR/Retroviral " 
/rpt_type=DISPERSED 
repeat_region 251644 . .251799 

/note=" (TA) n" 

/ rpt_f amily= " Simple_repeat " 

/ rpt_type= TANDEM 
repeat_region 251918. .252016 

/note=" (TATATG) n" 

/ rpt_f ami ly= " S imple_repeat " 

/ rpt_type= TANDEM 
repeat_region 252054. .252503 



/note="LlMA3 " 

/rpt_f amily= "LINE/LI " 

/rpt_type=DISPERSED 

repeat_region 252504. .252549 

/ note= "MADE1 " 
/rpt_f amily= "DNA/ Mariner" 
/rpt_type=DISPERSED 

repeat_region 252550. .253015 

/note="LlMA3 " 
/ rp t _f ami ly= " L INE / LI " 
/rpt_type=DISPERSED 

repeat_region 253016 .. 253314 

/note="AluSq" 
/rpt_f amily= " SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region 253315 . .253431 

/note="LlMA3 " 
/rpt_f amily= "LINE/LI » 
/rpt_type=DISPERSED 

repeat_region 253795. .253870 

/note="MLTlL" 
/ rp t_f ami ly = " LTR/MaLR » 
/rpt_type=DISPERSED 

repeat_region complement (254749 . .255256) 

/note="MLT2CB" 

/rpt_f amily= "LTR/Retroviral " 
/rpt_type=DISPERSED 

repeat_region complement (255877 . .256168) 

/note="AluJo" 
/rpt_family= "SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region complement (256689 . .256767) 

/note="AluJb" 
/rpt_family="SINE/Alu" 
/rpt__type=DISPERSED 

repeat_region 257615. .257663 

/note=" (TG) n" 

/rpt_family="Simple_repeat" 
/rpt_type=TANDEM 
repeat_region 257678. .257803 

/note="MER20" 

/rpt_family="DNA/MERl_type" 
/rpt_type=DISPERSED 
repeat_region 257864 . .258022 

/note="MIR" 

/rpt_family="SINE/MIR" 
/rpt_type=DISPERSED 

repeat_region 259701. .260270 

/note="LlPA13" 
/rpt_f amily= "LINE/LI " 
/rpt_type=DISPERSED 

repeat_region complement (261342 . .261638) 

/note="AluSq" 
/ rp t_f ami ly= " S INE / Alu " 
/rpt_type=DISPERSED 

repeat_region 261745 . .262041 

/note="AiuSx" 



/rpt_f amily= " SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region 263606. .263658 

/ not e = " MER4 6 A " 
/rpt_f ami ly= » DNA/MER2_type " 
/rpt_type=DISPERSED 

repeat_region complement (263659 . .263748) 

/note="LlPA9" 
/rpt_f amily= " LINE/LI " 
/rpt_type=DISPERSED 

repeat_region 263749. .263929 

/ not e= " MER4 6 A " 
/rpt_family="DNA/MER2_type" 
/rpt_type=DISPERSED 

repeat_region complement (263948 . .264193) 

/note="MIR w 

/rpt_f amily= "SINE/MIR " 
/rpt_type=DISPERSED 

repeat_region 264314 . . 264542 

/note="LlMA2 " 
/rpt_family= " LINE/LI " 
/rpt_type=DISPERSED 

repeat_region 264678. .265115 

/note="LTR3" 

/rpt_family= "LTR/Retroviral " 
/rpt_type=DISPERSED 
repeat_region complement (265834 . .265929) 

/note="MIR" 

/rpt_f amily= "SINE/MIR 11 
/rpt_type=DISPERSED 
repeat_region complement (265977 . .266095) 

/note="L2" 

/rpt_f amily="LINE/L2 " 
/rpt_type=DISPERSED 

repeat_region complement (266377 . .266741) 

/note="MLTlAl n 
/rpt_f amily= " LTR/MaLR" 
/ rpt_type=DIS PERSED 

repeat_region complement (267097 . .267399) 

/note="AluSq" 
/rpt_f amily= " SINE/Alu " 
/rpt_type=DISPERSED 

repeat_region 267891. .268019 

/note="LTR50" 
/ rp t_f ami ly= " LTR " 
/rpt_type=DISPERSED 

repeat_region 268067 . . 268247 

/ note= " LTR3 3 A" 

/ rp t _f ami 1 y = " LTR /Ret r o v i r a 1 " 
/rpt_type=DISPERSED 

repeat_region 269782. .270098 

/note="AluSp" 
/ rpt_f ami ly= " S INE / Alu » 
/rpt_type=DISPERSED 

repeat_region complement (271014 . .271329) 

/note="AluJo" 
/ rpt_f ami 1 y= " S INE /Alu " 



/rpt_type=DISPERSED 

repeat_region 272695 .. 272808 

/note="AluJo" 
/rpt_f amily= " SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region 272809. .272840 

/note=" (TAA)n" 
/rpt_f amily= "Simplejrepeat " 
/ rpt_type=TANDEM 

repeat_region 272841. .273006 

/note="AluJo" 
/rpt_family=»SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region complement (274131 . . 274200) 

/note="MIR n 

/rpt__family="SINE/MIR" 
/rpt_type=DISPERSED 

repeat_region 275496. .275790 

/note="AluSx" 
/rpt_f ami ly= " S INE/Alu " 
/ rpt_type=DISPERSED 

repeat_reigion 276789 277342 

/note= f, LlPA7 n 
/rpt_family="LINE/Ll" 
/rpt_type=DISPERSED 

repeat_region complement (277965 . .278057) 

/note="Ll" 

/rpt_f amily= " LINE/LI " 
/rpt_type=DISPERSED 

repeat_region complement (278058 . .2782 85) 

/note="AluSc" 
/ rpt _f ami ly = " S INE / Alu " 
/rpt_type=DISPERSED 

repeat_region complement (2 78286 . .278384) 

/note="Ll" 

/rpt_f amily= "LINE/LI " 
/ rpt_t ype=DI SPERSED 

repeat^region complement (279899 . .280200) 

/note= fl AluSq" 
/rpt_f amily= " SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region 281586. .281629 

/note= n (CTGGGG) n" 
/rpt_f amily= " Simple_repeat " 
/ rpt_type= TANDEM 

repeat_region 284328 . , 284751 

/ not e= " MLT1D " 
/ rp t_f ami ly = « LTR/MaLR " 
/rpt_type=DISPERSED 

repeat_region 285631. .285802 

/note="MIR" 

/rpt_family="SINE/MIR" 
/rpt_type=DISPERSED 
repeat_region 286153 . .286235 

/not e= "MIR" 

/rpt_family= " SINE/MIR" 
/rpt_type=D!SPERSED 



repeat_region 287050. .287194 

/note="MIR" 

/ rp t_f ami ly = "SI NE /MIR" 
/rpt_type=DISPERSED 

repeat_region 287532 .. 287722 

/note="MLTH" 
/rpt_f amily= " LTR/MaLR " 
/rpt_type=DISPERSED 

repeat_region 291286. .291551 

/note="AluSp" 
/rpt_f amily= " SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region 292485. .292586 

/note="L2" 

/rpt_f amily= " LINE/L2 " 
/rpt_type=DISPERSED 
repeat_region 2 92 637.. 293015 

/note="L2" 

/rpt_family="LINE/L2" 
/rpt_type=DISPERSED 

repeat_region complement (293025 . .293207) 

/note="LlMEc" 
/rpt_f amily= " LINE/LI " 
/rpt_type=DISPERSED 

repeat_region complement (2932 08 . .293339) 

/note="LlPBl» 
/rpt_family="LINE/Ll" 
/rpt_type=DISPERSED 

repeat_region complement (293340 . .293655) 

/note="LlMEc" 
/rpt_family=: "LINE/LI" 
/rpt_type=DISPERSED 

repeat_region complement (293772 . .294060) 

/hote="AluSq" 
/rpt_family="SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region 2 94612 294770 

/not e= "MIR" 

/rpt_family="SINE/MIR" 

/rpt_type=DISPERSED 
repeat_region 294938 .. 294974 . 

/note=" (TATG)n" 

/rpt_f amily= 11 Simple_repeat " 

/ rpt_type= TANDEM 
repeat_region 295016. .295118 

/note="MIR" 

/rpt_family=" SINE/MIR" 
/rpt_type=DISPERSED 

repeat_region 295484 . .295614 

/note="MLTUl" 
/rpt_f amily= "LTR/MaLR" 
/rpt_type=DISPERSED 

repeat_region 296092. .296159 

/note=" (TG) n" 

/rpt_f amily= "Simple_repeat " 
/rpt_type=TANDEM 
repeat_region complement (2 98406 . .2 98970) 



/note= "MLT2D " 

/ r p t_f am i 1 y = " LTR / Re t r ov i r a 1 " 
/ rpt_type=DIS PERSED 

repeat_region complement (300518 . .301093) 

/note="LlMEc n 
/rpt_f amily= "LINE/LI " 
/rpt_type=DISPERSED 

repeat_region complement (301314 . .301429) 

/note-"LlMEc" 
/rpt_f amily= ■ LINE/LI " 
/rpt_type=DISPERSED 

repeat_region 302208. .303140 

/note= ,f LlMC3" 
/rpt_family="LINE/Ll" 
/rpt_type=DISPERSED 

repeat_region complement (303275 . .303473) 

/note="MIR" 

/rpt_family= "SINE/MIR" 
/rpt_type=DISPERSED 

repeat_region complement (3 03497 . .303797) 

/note="AluY n 
/rpt_£amily« "SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region 304920. .305274 

/note="AluJo" 
/ rp t _f ami ly= " SINE / Alu " 
/rpt_type=DISPERSED 

repeat_region 306916. .307223 

/note="AluSq H 
/rpt_f amily= "SINE/Alu" 
/rpt_type=DISPERSED 

repeat_region 308362. .308420 

/note="L2" 

/rpt_family="LINE/L2" 
/rpt_type=DISPERSED 

repeat_region 311366 311738 

/note="MSTD" 
/rpt_f amily= " LTR/MaLR " 
/rpt_type=DISPERSED 

repeat_region 312414 .. 312485 

/note="L2" 

/ rp t_f ami ly= " LINE / L2 " 
/rpt_type=DISPERSED 

repeat_region complement (312709 . .314432) 

/note="LlMCb" 
/rpt_f amily= " LINE/LI " 
/rpt_type=DISPERSED 

repeat_region 315703. .316838 

/note="LlME3 n 
/rpt_f amily= " LINE/LI " 
/rpt_type=DISPERSED 

STS 316289. .316432 

/s tandard_name= " D2 1S2 03 5 " 
/note= "Accession No. G15170 
SHGC-15782" 

repeat_region 317444 .. 317519 

/note="L2" 



/rpt_f amily= " LINE/L2 " 
/rpt_type=DISPERSED 
STS 318252 . .318359 

/ st andard_name= "AO 0 8F2 9 " 
/note= "Accession No. R66691 
stSG10253" 

/db_xref="GDB: 9882770" 
STS 318260.. .318394 

/ s t andar d_name ="sts-R67568" 
repeat_region complement (32 0906 . .321055) 

/note="MIR" 

/rpt_family= "SINE/MIR" 
/rpt_type=DISPERSED 
repeat_region 321167 .. 321255 

/not e= "MIR" 

/rpt_family=" SINE/MIR" 
/rpt_type=DISPERSED 
repeat_region 321719. .321745 

/note=" (T)n" 

/rpt_family="Simple_repeat" 
/ rpt_type=TANDEM 
repeat_region complement (321870 . .322008) 

/note="L2" 

/ rpt_f ami ly= " LINE/L2 " 
/rpt_type=DISPERSED 
repeat_region complement (322685 . .323347) 

/note="Ll" 

/ rp t_f ami 1 y = " L I NE / L 1 " 
/rpt_type=DISPERSED 

repeat^region complement (323404 . .324689) 

/note="LlMl" 
/rpt_family="LINE/Ll" 
/rpt_type=DISPERSED 

repeat_region complement (324690 . .325208) 

/note="MLT2CA" 

/rpt_f ami ly= " LTR /Retroviral " 
/rpt__type=DISPERSED 

repeat_region complement (325209 . . 325655) 

/note="LlMl" 
/rpt_family="LINE/Ll n 
/rpt_type=DISPERSED 

repeat_region 325739 . . 326287. 

/ note= "HERVL4 0 " 
/rpt_family="LTR/Retroviral" 
/ rpt_type=DI S PERSED 

repeat_region 326288 . . 326625 

/note="MER2 " 

/rpt_f amily= "DNA/MER2_type " 
/rpt_type=DISPERSED 

repeat_region 326626 . . 326725 

/note= "HERVL40" 
/ r Pt_f ami ly = " LTR/Re t r ovi r al " 
/rpt_type=DISPERSED 

repeat_region 326739 . . 326833 

/note="HE R VL 40" 
/rpt_family="LTR/Retroviral" 
/rpt_type=DISPERSED 



repeat_region 32 6996. .32742 9 

/note= M LTR40b" 

/rpt_f amily="LTR/Retroviral " 
/rpt_type=DISPERSED 

repeat_region 327971 .. 328087 

/note=" (TATATG) n" 
/rpt_f amily= " Simple_repeat " 
/ rpt_type=TANDEM 

repeat_region 331379. .331439 

/note="MIR" 

/rpt_f ami ly= " SINE/MIR " 
/rpt_type=DISPERSED 
repeat_region 331577 .. 331711 

/note="MIR" 

/rpt_family=" SINE/MIR" 
/rpt_type=DISPERSED 
repeat_region complement (332106 . .332264) 

/note="MIR» 

/ rpt_f amily= "SINE/MIR" 
/ rpt_t ype=DI SPERSED 

repeat_region 332265 . .332576 

/note="AluSx" 
/rpt_f amily= " SINE/Alu 11 
/rpt_type=DISPERSED 

repeat_region complement (332577 . . 332631) 

/note="MIR" 

/rpt_f amily= "SINE/MIR " 

/rpt_type=DISPERSED . 
STS 333468. .333657 

/standard_name="D21S1270" 

/note= "Accession No. G08079 

CHLC . GATA7G0 9 " 
repeat_region 333513 .. 333586 

/note=" (TAGA) n M 

/rpt_f amily= "Simple_repeat " 

/ r p t _t yp e = TAND EM 
repeat_region 333613 333682 

/note=" (TG)n n 

/ rpt__f ami ly = " S impl e_r epea t " 
/rpt_type=TANDEM 
repeat_region complement (33 3 703 . .334117) 

/ no t e = " MLT2 CB " 

/rpt_family="LTR/Retroviral" 
/rpt_type=DISPERSED 
repeat_region complement (334397 . .334631) 

/note="MIR M 

/ rpt_f amily= "SINE/MIR" 
/rpt_type=DISPERSED 

repeat_region 334655 .. 334951 

/note="AluY" 
/ rp t_f ami ly = " S INE / Alu " 
/rpt_type=DISPERSED 

repeat_region 338183. .338215 

/note=" (TTTC)n" 
/rpt_f amily= 11 Simp le_r epea t " 
/rpt_type= TANDEM 

repeat_region complement (33 8216 . .338525) 



/note="AluSq" 
/ rp t_f ami ly= » S I NE/Alu 11 
/rpt_type=DISPERSED 
repeat_region 338527 . . 338970 

/note="L2» 

/rpt_f amily= « LINE/L2 « 
/rpt_type=DISPERSED 
repeat_region complement (33 9226 . .339529) 

/note="AluSx" 
/rpt_f amily= " SINE/Alu" 
/rpt_type=DISPERSED 



BASE COUNT 
ORIGIN 



ACCESSION 
KEYWORDS 

SOURCE 

ORGANISM 
REFERENCE 

AUTHORS 

TITLE 



JOURNAL 



COMMENT 



2005 100 .0.0 

P__AAH64874 Human secreted protein cDNA, SEQ ID NO: 150. 045 bp, 
cDNA, PAT ll-SEP-2001 
P_AAH64874 

GENESEQ; Human; secreted protein; gene therapy; vaccine; treatment; 
diagnosis; GENSET; patent; patentdb (v200420, 23-SEP-2004) . 
Homo sapiens. 
Homo sapiens. 
1 (bases 1 to 2045) 

Dumas Milne Edwards , J . , Bougueleret , L . , Jobert , S . 

Full length GENSET human nucleic acids encoding potentially secreted 
proteins, useful in gene therapy and vaccination against a variety 
of diseases, and for diagnosis of those diseases. 

Patent: WO200142451-A2 ; Filing Date: 07-DEC-2000; 2000WO-IB001938 ; 
Publication Date: 14-JUN-2001; Priority: 08-DEC-1999; 
99US-0169629P. 06-MAR-2000; 2000US- 0187470P ; Assignee: (GEST ) 
GENSET; Cross Reference: WPI; 2001-367870/38. P-PSDB; AAG89271; 
Patent Format: Claim 7; Page 713-714; 921pp; English. 

The invention relates to full length GENSET human nucleic acids 
encoding potentially secreted proteins. The nucleic acids and the 
polypeptides they encode may be used in the prevention, treatment 
and diagnosis of diseases associated with inappropriate GENSET gene 
expression. For example, they be used to treat disorders associated 
with decreased GENSET gene expression by rectifying mutations or 
deletions in a patient's genome that affect the activity of GENSET 
or by supplementing the. patients own production of GENSET 
polypeptides. Conversely, antisense nucleic acid molecules may be 
administered to down regulate GENSET expression by binding with the 
cells 1 own genes and preventing their expression. The sense and 
antisense nucleic acids may also be used as DNA probes in diagnostic 
assays to detect and quantitate the presence of similar nucleic 
acid sequences in samples, and hence to determine which patients 
may be in need of restorative therapy. The GENSET polypeptides may 
be used as antigens in the production of antibodies and in assays to 
identify modulators (agonists and antagonists) of GENSET polypeptide 
expression and activity. The present sequence is a GENSET nucleic 
acid of the invention 
FEATURES Location/Qualifiers 
BASE COUNT 620 a 352 c 398 g 675 t 

ORIGIN 



1930 100 0.0 

HSA2507il Homo sapiens CLDN8 gene for claudin-8. 1931 bp, 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



gene 



CDS 



polyA_signal 
BASE COUNT 
ORIGIN 



DNA, linear, PRI ll-NOV-1999 
AJ250711 

AJ250711 .1 GI : 6433859 
claudiri-8; CLDN8 gene. 
Homo sapiens (human) 
Homo sapiens 
Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 

Keen, T.J. and Inglehearn, C. F. 

Unpublished 

2 (bases 1 to 1931) 

Keen, T.J. 

Direct Submission 

Submitted (08-NOV-1999) Keen T. J. , Molecular Medicine Unit, 
University of Leeds, Clinical Sciences Building, St James's 
Hospital, Leeds, LS9 7TF, UNITED KINGDOM 
Location/Qualifiers 
1...1931 
/organism="Homo sapiens" 
/mol_type=" genomic DNA" 
/db_xref="taxon: 9606" 
/chromosome= "21" 
/map="21q22.1" 

23. .700 
/gene="CLDN8" 

23.. 700 
/gene="CLDN8 n 
/ codon_s t ar t = 1 
/product= " claudin- 8 " 
/protein_id="CAB60615 . 1" 
/db_xref="GI: 6433860" 
/ db__x ref="GOA:P56748" 
/db_xref = "Swiss-Prot : P56748 " 
1913 . .1918 



1882 100 0.0 

P_AAS00668 Human INTERCEPT 309 cDNA. 909 bp, cDNA, PAT 07-SEP-2001 
ACCESSION P_AAS00668 

KEYWORDS GENES EQ; Human; INTERCEPT 309; T cell; heart; liver; pancreas; 

placenta; brain; skeletal muscle; kidney; spleen; lymph node; 
peripheral blood leukocyte; bone marrow; thymus tissue; abnormal 
blood coagulation; asthma; cancer; anaphylaxis; hepatitis; multiple 
sclerosis; coronary artery disease,- ss; malaria; atopic dermatitis; 
amyotrophic lateral sclerosis; meningitis; attention deficit 
disorder; Crohn's disease; gastroenteritis; goitre; hypoglycaemia; 
diabetes mellitus; endometriosis; pulmonary embolism; muscular 
dystrophy; immuno- competence ; vertebrate; blood; serum; lung; 
patent; patentdb (v200420, 23-SEP-2004) . 
SOURCE Homo sapiens. 

ORGANISM Homo sapiens. 

1 (bases 1 to 1909) 

Mackay,C.R., Myers, P. S., Kirst,S.J., Fraser,C.C, Leiby,K.R. 
New isolated nucleic acid molecule for diagnosis, prevention, and 
therapy of human and other animal disorder, or as modulating agent 



REFERENCE 
AUTHORS 
TITLE 



for regulating cellular processes. 
JOURNAL Patent: WO200129088-A1 ; Filing Date: 23-JUN-2000; 2000WO-US017386 ; 
Publication Date: 26-APR-2001; Priority: 19-OCT-1999; 
99US-00420707; Assignee: (MILL-) MILLENNIUM PHARM INC; Cross 
Reference: WPI; 2001-308477/32. P-PSDB; AAU00678; Patent Format: 
Claim 1; Fig 3A-3C; 263pp; English. 

COMMENT The sequence represents a cDNA which encodes human INTERCEPT 3 09 

polypeptide. This protein and similar others exhibit the ability to 
affect growth, proliferation, survival, differentiation, activity, 
morphology, or movement /migration of , e.g. T cells and cells of the 
heart, liver, pancreas, placenta, brain, lung, skeletal muscle, 
kidney, spleen, lymph node, peripheral blood leukocyte, bone marrow 
or thymus tissue. They can be used as modulating agents for 
regulating cellular processes, thus, the proteins and their 
associated nucleic acids can be used to prognosticate, prevent, 
diagnose, or treat disorders associated with physiological 
processes. These disorders include abnormal blood coagulation, 
asthma, anaphylaxis, hepatitis, multiple sclerosis, cancer, 
coronary artery disease, malaria, atopic dermatitis, amyotrophic 
lateral sclerosis, meningitis, attention deficit disorder, Crohn's 
disease, gastroenteritis, goitre, hypoglycaemia, diabetes mellitus, 
endometriosis, pulmonary embolism and muscular dystrophy. Antibodies 
to disorders such as these can be made by providing a polypeptide 
of the invention to an immuno- competent vertebrate and harvesting 
blood or serum from the vertebrate 

FEATURES Location/Qualifiers 
CDS 2.. 649 

/*tag= a 

/product= "Human INTERCEPT 309" 
si 9_JP e Ptide 2.. 73 

/*tag= b 
mat_j>eptide 74.. 646 

/*tag= c 

/product= "Mature human INTERCEPT 309" 
BASE COUNT 586 a 318 c 368 g 637 t 

ORIGIN 

1832 100 0.0 

P_AAH64876 Human secreted protein cDNA, SEQ ID NO: 152. 931 bp, 

cDNA, PAT ll-SEP-2001 
ACCESSION P_AAH64876 

KEYWORDS GENESEQ; Human; secreted protein; gene therapy; vaccine; treatment; 

diagnosis; GENSET; patent; patentdb (v200420, 23-SEP-2004) . 
SOURCE Homo sapiens. 

ORGANISM Homo sapiens. 
REFERENCE 1 (bases 1 to 1931) 

AUTHORS Dumas Milne Edwards , J. , Bougueleret , L . , Jobert , S . 

TITLE Full length GENSET human nucleic acids encoding potentially secreted 

proteins, useful in gene therapy and vaccination against a variety 

of diseases, and for diagnosis of those diseases. 
JOURNAL Patent: WO200142451-A2 ; Filing Date: 07-DEC-2000; 2000WO-IB001938 ; 

Publication Date: 14-JUN-2001; Priority: 08-DEC-1999; 

99US-0169629P. 06-MAR-2000; 2000US-0187470P; Assignee: (GEST ) 

GENSET; Cross Reference: WPI; 2001-367870/38. P-PSDB; AAG89273; 

Patent Format: Claim 7; Page 715-716; 921pp; English. 
COMMENT The invention relates to full length GENSET human nucleic acids 

encoding potentially secreted proteins. The nucleic acids and the 



polypeptides they encode may be used in the prevention, treatment 
and diagnosis of diseases associated with inappropriate GENSET gene 
expression. For example, they be used to treat disorders associated 
with decreased GENSET gene expression by rectifying mutations or 
deletions in a patient's genome that affect the activity of GENSET 
or by supplementing the patients own production of GENSET 
polypeptides. Conversely, antisense nucleic acid molecules may be 
administered to down regulate GENSET expression by binding with the 
cells' own genes and preventing their expression. The sense and 
antisense nucleic acids may also be used as DNA probes in diagnostic 
assays to detect and quant itate the presence of similar nucleic 
acid sequences in samples, and hence to determine which patients 
may be in need of restorative therapy. The GENSET polypeptides may 
be used as antigens in the production of antibodies and in assays to 
identify modulators (agonists and antagonists) of GENSET polypeptide 
expression and activity. The present sequence is a GENSET nucleic 
acid of the invention 

FEATURES Location/Qualifiers 

BASE COUNT 587 a 343 c 393 g 608 t 

ORIGIN 

1831 99 0.0 

P_AAH15597 Human cDNA sequence SEQ ID NO: 13909. 890 bp, cDNA, PAT 26-JUN-2001 
ACCESSION P_AAH15597 

KEYWORDS GENE SEQ; Human; primer; detection; diagnosis; antisense therapy; 

gene therapy; patent; patentdb (v200420/ 23-SEP-2004) . 
SOURCE Homo sapiens. 

ORGANISM Homo sapiens. 
REFERENCE 1 (bases 1 to 1890) 

AUTHORS Ota,T., Isogai,T., Nishikawa, T. , Hayashi,K., Saito,K., 
Yamamoto,J. Ishii,S., Sugiyama,T., Wakamatsu,A. , Nagai,K., 
Otsuki,T. 

TITLE Primer sets for synthesizing polynucleotides, particularly the 5602 

full- length cDNAs defined in the specification, and for the 
detection and/or diagnosis of the abnormality of the proteins 
encoded by the full-length cDNAs. 

JOURNAL Patent: EP1074617-A2; Filing Date: 28- JUL-2000 ; 2000EP-00116126 ; 
Publication Date: 07-FEB-2001; Priority: 29-JUL-1999; 
99JP-00248036. 27-AUG-1999; 99JP-00300253 . ll-JAN-2000; 
2000JP-00118776. 02 -MAY-2000 ;. 2000JP-00183767 . 09-JUN-2000; 
2000JP-00241899; Assignee: (HELI-) HELIX RES INST; Cross Reference: 
WPI; 2001-318749/34; Patent Format: Claim 8; SEQ ID NO 13909; 2537pp 
Sequence Listing; English. 
COMMENT The present invention describes primer sets for synthesising 5602 

full- length cDNAs defined in the specification. Where a primer set 
comprises: (a) an oligo-dT primer and an oligonucleotide 
complementary to the complementary strand of a polynucleotide which 
comprises one of the 5602 nucleotide sequences defined in the 
specification, where the oligonucleotide comprises at least 15 
nucleotides; or (b) a combination of an oligonucleotide comprising 
a sequence complementary to the complementary strand of a 
polynucleotide which comprises a 5 '-end sequence and an 
oligonucleotide comprising a sequence complementary to a 
polynucleotide which comprises a 3 '-end sequence, where the 
oligonucleotide comprises at least 15 nucleotides and the 
combination of the 5 '-end sequence/3 ' -end sequence is selected from 
those defined in the specification. The primer sets can be used in 



antisense therapy and in gene therapy. The primers are useful for 
synthesising polynucleotides, particularly full-length cDNAs. The 
primers are also useful for the detection and/or diagnosis of the 
abnormality of the proteins encoded by the full-length cDNAs. The 
primers allow obtaining of the full-length cDNAs easily without any 
specialised methods . AAH03166 to AAH13628 and AAH13633 to AAH18742 
represent human cDNA sequences; AAB92446 to AAB95893 represent human 
amino acid sequences; and AAH13629 to AAH13632 represent 
oligonucleotides, all of which are used in the exemplification of 
the present invention 

FEATURES Location/Qualifiers 

BASE COUNT 565 a 341 c 384 g 600 t 

ORIGIN 

1831 99 0.0 

AK022269 Homo sapiens cDNA FLJ12207 fis, clone MAMMA1000956 , highly similar 
to Homo sapiens CLDN8 gene for claudin-8. 1890 bp, 
mRNA, linear, PRI 30-JAN-2004 

ACCESSION AK022269 

VERSION AK022269.1 GI: 10433628 

KEYWORDS oligo capping; fis (full insert sequence) . 
SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 

AUTHORS Ota, T. , Suzuki, Y. , Nishikawa, T. , Otsuki,T., Sugiyama,T., Irie,R., 
Wakamatsu, A. , Hayashi,K., Sato,H., Nagai,K., Kimura,K., Makita,H., 
Sekine , M . , Obayashi , M . , Nishi , T . , Shibahara, T . , Tanaka, T . , 
Ishii,S., Yamamoto,J., Saito,K., Kawai,Y., Isono,Y., Nakamura,Y., 
Nagahari,K., Murakami, K. , Yasuda,T., Iwayanagi , T . , Wagatsuma, M . , 
Shiratori,A. , Sudo,H., Hosoiri,T., Kaku,Y., Kodaira,H., Kondo,H. # 
Sugawara , M . , Takahashi , M . , Kanda , K . , Yokoi , T . , Furuya , T . , 
Kikkawa,E., Omura,Y., Abe,K., Kamihara,K., Katsuta,N. , Sato,K., 
Tanikawa,M., Yamazaki,M., Ninomiya,K., Ishibashi , T. , Yamashita, H . , 
Murakawa,K., Fujimori, K. , Tanai,H., Kimata,M., Watanabe,M., 
Hiraoka,S., Chiba,Y., Ishida,S., Ono,Y. , Takiguchi , S . , Watanabe,S., 
Yosida,M., Hotuta,T., Kusano,J., Kanehori,K., Takahashi -Fuj ii , A. , 
Hara,H., Tanase,T., Nomura, Y. , Togiya,S., Komai,F., Hara,R., 
Takeuchi,K., Arita,M., Imose,N., Musashino, K. , Yuuki,H., Oshima,A., 
Sasaki,N., Aotsuka,S., Yoshikawa, Y. , Matsunawa, H . , Ichihara,T., 
Shiohata,N., Sano,S. , Moriya,S. # Momiyama,H., Satoh,N., Takami,S., 
Terashima, Y. , Suzuki, O., Nakagawa,S., Senoh,A. , Mizoguchi,H. , 
Goto,Y., Shimizu,F., Wakebe,H., Hishigaki , H. , Watanabe,T., 
Sugiyama , A . , Takemoto , M . , Kawakami , B . , Yamazaki , M . , Watanabe , K. , 
Kumagai , A . , I takura , S . , Fukuzumi , Y . , Fuj imori , Y . , Komiyama , M . , 
Tashiro,H., Tanigami,A., Fujiwara,T., Ono,T., Yamada,K., Fujii,Y., 
Ozaki,K., Hirao,M., Ohmori,Y., Kawabata, A. , Hikij i, T. , Kobatake,N., 
Inagaki,H., Ikema,Y., Okamoto,S., Okitani,R., Kawakami, T., 
Noguchi,S., Itoh,T., Shigeta,K., Senba,T., Matsumura, K. , 
Nakajima,Y., Mizuno,T., Morinaga,M. , Sasaki, M. , Togashi,T., 
Oyama,M., Hata,H., Watanabe,M. , Komatsu,T., Mizushima-Sugano, J. , 
Satoh,T., Shirai,Y., Takahashi , Y . , Nakagawa,K., Okumura,K., 
Nagase , T . , Nomura , N . , Kikuchi , H . , Masuho , Y . , Yamashita , R . , 
Nakai,K., Yada,T., Nakamura,Y., Ohara,0., Isogai,T. andSugano,S. 

TITLE Complete sequencing and characterization of 21,243 full-length 
human cDNAs 



JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



Nat . Genet 

14702039 

2 

Isogai, T. , 
Nishikawa,T. 
Wagatsuma,M. 
Sugawara,M. , 



36 (1) , 40-45 (2004) 



FEATURES 

source 



Ota,T., Hayashi,K., Sugiyama,T., Otsuki,T., Suzuki, Y. , 
Nagai,K., Sato,H., Sugano,S., Shiratori, A. , Sudo,H., 
Hosoiri,T., Kaku,Y., Kodaira,H., Kondo,H., 
Takahashi , M . , Chiba,Y., Ishida,S., Murakawa,K., 
Ono , Y . , Takiguchi , S . , Wat anabe , S . , Kimura , K. , Murakami , K . , 
Ishii,S., Kawai,Y. # Saito,K., Yamamoto , J . , Wakamatsu, A. , 
Nakamura , Y . , Nagahar i , K . , Masuho , Y . , Ninomiya , K . and I wayanagi , T . 
NEDO human cDNA sequencing project 
Unpublished 
3 (bases 1 to 1890) 
Isogai,T. and Otsuki,T. 
Direct Submission 

Submitted (23 -AUG-2000) Takao Isogai, Helix Research Institute, 
Genomics Laboratory; 1532-3 Yana, Kisarazu, Chiba 292-0812, Japan 
(E-mail: genomics@hri.co.jp, Tel : 81-43 8-52-3975 , Fax: 81-43 8-52-3 986) 

NEDO human cDNA sequencing project supported by Ministry of 
International Trade and Industry of Japan; cDNA full insert 
sequencing: Research Association for Biotechnology; cDNA library 
construction, 5 f - & 3 ' -end one pass sequencing and clone selection: 
Helix Research Institute (supported by Japan Key Technology Center 
etc.) and Department of Virology, Institute of Medical Science, 
University of Tokyo. 

Location/Qualifiers 
1. .1890 
/organism="Homo sapiens" 
/mol_type= "mRNA" 
/db_xref = " taxon : 9606 " 
/clone="MAMMA1000956" 
/tissue_type="Mammary gland" 
/clone_l ib= "MAMMA1 " 
/note=" cloning vector: pME18SFL3" 
795^796 

/note="5 bases segment is present in AJ250711 , AP001707.1 
and AP000884.1." 



misc feature 



BASE COUNT 
ORIGIN 



164. 793 bp, 



1692 100 0.0 

P__AAH64888 Human secreted protein cDNA, SEQ ID NO: 
CDNA, PAT ll-SEP-2001 
P_AAH64888 

GENES EQ ; Human; secreted protein; gene therapy; vaccine; treatment; 
diagnosis; GENSET; patent; patentdb (v200420, 23-SEP-2004) . 
Homo sapiens. 
Homo sapiens. 
1 (bases 1 to 1793) 

Dumas Milne Edwards, J., Bougueleret ,L. , Jobert,S. 

Full length GENSET human nucleic acids encoding potentially secreted 
proteins, useful in gene therapy and vaccination against a variety 
of diseases, and for diagnosis of those diseases. 

Patent: WO200142451-A2; Filing Date: 07-DEC-2000; 2000WO- IB001938 ; 
Publication Date: 14-JUN-2001; Priority: 08-DEC-1999; 
99US-0169629P. 06-MAR-2000; 2000US-0187470P; Assignee: (GEST ) 
GENSET; Cross Reference: WPI; 2001-367870/38. P-PSDB; AAG89285; 



ACCESSION 
KEYWORDS 

SOURCE 

ORGANISM 
REFERENCE 

AUTHORS 

TITLE 



JOURNAL 



Patent Format: Claim 7; Page 726-727; 921pp; English. 

COMMENT The invention relates to full length GENSET human nucleic acids 

encoding potentially secreted, proteins . The nucleic acids and the 
polypeptides they encode may be used in the prevention, treatment 
and diagnosis of diseases associated with inappropriate GENSET gene 
expression. For example, they be used to treat disorders associated 
with decreased GENSET gene expression by rectifying mutations or 
deletions in a patient's genome that affect the activity of GENSET 
or by supplementing the patients own production of GENSET 
polypeptides. Conversely, antisense nucleic acid molecules may be 
administered to down regulate GENSET expression by binding with the 
cells' own genes and preventing their expression. The sense and 
antisense nucleic acids may also be used as DNA probes in diagnostic 
assays to detect and quantitate the presence of similar nucleic 
acid sequences in samples, and hence to determine which patients 
may be in need of restorative therapy. The GENSET polypeptides may 
be used as antigens in the production of antibodies and in assays to 
identify modulators (agonists and antagonists) of GENSET polypeptide 
expression and activity. The present sequence is a GENSET nucleic 
acid of the invention 

FEATURES Location/Qualifiers 

BASE COUNT 544 a 318 c 369 g 556 t 

ORIGIN 

1689 99 0.0 

P_AAH64890 Human secreted protein cDNA, SEQ ID NO: 166. 748 bp, 

CDNA, PAT ll-SEP-2001 
ACCESSION P_AAH64890 

KEYWORDS GENES EQ ; Human; secreted protein; gene therapy; vaccine; treatment; 

diagnosis; GENSET; patent; patentdb (v200420, 23-SEP-2004) . 
SOURCE Homo sapiens. 

ORGANISM Homo sapiens. 
REFERENCE 1 (bases 1 to 1748) 

AUTHORS Dumas Milne Edwards , J. , Bougueleret , L . , Jobert , S . 

TITLE Full length GENSET human nucleic acids encoding potentially secreted 

proteins, useful in gene therapy and vaccination against a variety 
of diseases, and for diagnosis of those diseases. 

JOURNAL Patent: WO200142451-A2 ; Filing Date: 07-DEC-2000; 2000WO-IB001938 ; 
Publication Date: 14-JUN-2001; Priority: 08-DEC-1999 ; 
99US-0169629P. 06-MAR-2000; 2000US-0187470P; Assignee: (GEST ) 
GENSET; Cross Reference: WPI; 2001-367870/38. P-PSDB; AAG89287; 
Patent Format: Claim .7; Page 729; 921pp; English. 
COMMENT The invention relates to full length GENSET human nucleic acids 

encoding potentially secreted proteins. The nucleic acids and the 
polypeptides they encode may be used in the prevention, treatment 
and diagnosis of diseases associated with inappropriate GENSET gene 
expression. For example, they be used to treat disorders associated 
with decreased GENSET gene expression by rectifying mutations or 
deletions in a patient's genome that affect the activity of GENSET 
or by supplementing the patients own production of GENSET 
polypeptides. Conversely, antisense nucleic acid molecules may be 
administered to down regulate GENSET expression by binding with the 
cells 1 own genes and preventing their expression. The sense and 
antisense nucleic acids may also be used as DNA probes in diagnostic 
assays to detect and quantitate the presence of similar nucleic 
acid sequences in samples, and hence to determine which patients 
may be in need of restorative therapy. The GENSET polypeptides may 



be used as antigens in the production of antibodies and in assays to 
identify modulators (agonists and antagonists) of GENSET polypeptide 
expression and activity. The present sequence is a GENSET nucleic 
acid of the invention 
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Genome-wide Study of Gene Copy Numbers, 
Transcripts, and Protein Levels in Pairs of 
Non-invasive and Invasive Human Transitional 
Cell Carcinomas* 

Tortoen F. 0rntoft|:§, Thomas ThykjaeiH, Frederic M. Waldmanll, Hans Wolf**, 
and Julio E. Cells** 

Gain and loss of chromosomal material is characteristic phenomenon at both the transcription and translation levels 

of bladder cancer, as well as malignant transformation In High throughput array studies of the breast cancer cell line 

ZlTlSSr^^ 63 * b °J? the BT474 tes ^ggested that there is a correlation between 

tempted to address thls fluesUon In pate of noninvasive *T ?* ^ StU ^ S °L^ du ? flenes in solid tumore 

and invasive human bladder tumore using a combination revealed a 90od correlation between gene dose and 

of technology that included comparative genomic hybrid- m NA or prote,n ,evels in the case of c -«*-B2. cyclln d1 , 

featlon, high density oligonucleotide array-based monitor- ems1 ' and N - m y c C 3-5 )- However, a high cyclin D1 protein 

Ing of transcript levels (5600 genes), and high resolution expression has been observed without simultaneous am- 

two-dimensional gel electrophoresis/the results showed ~J J,lflcation < 4 )- a low level of c-myc copy number in- 

that there is a gene dosage effect that in some cases crease was observed without concomitant c-myc protein 

superimposes on other regulatory mechanisms. This ef- overexpresslon (6). . 

feet depended (p < 0.015) on the magnitude of the com- In human bladder tumors, karyotyping, fluorescent In sftu 

showed either reduced or unaltered transcript level^Be- 0336 «* non ' lnvasive pTa *»nsKtonal <»" carcinomas (TOCs). 
cause most proteins resolved by two-dimensional gels ,ndudes 1088 01 chromosome 9 or parts of it. as well as 
are unknown It was only possible to compare mRNA and 1088 Y ,n ma,es - ln minimally Invasive pTI TCCs, the fol- 
proteln alterations in relatively few cases of well focused low,n 9 alterations have been reported: 2q-, 11p-, 1q+, 
abundant proteins. &[»h few exceptions we found a good 1 1q1 3+. 1 7q+. and 20q+ (7-12). It has been suggested that 
correlation (p < 0.005) between transcript alterations and these regions harbor tumor suppressor genes and onco- 
protein levels. The Implications, as well as limitations, genes; however, the large chromosomal areas Involved often 
of the approach are discussed. Molecular & Celfuter contain many genes, making meaningful predictions of the 
Proteomlcs 1:37-45, 2002. functional consequences of losses arid gains very difficult. 

In this investigation we have combined genome-wide tech- 

(1), but little is kn own about the genome-wide effect of this teomlcs) to determine the effect of gene copy number on 

~— — — — — — transcript and protein levels In pairs of non-Invasive and Irv 

Rom the ^Department of Cfinical Biochemistry, Molecular Dlag- vaslve human bladder TCP* 
noetic Laboratory and -Department of Urology. Aarhus University 

Hospital, Skejby, DK-8200 Aarhus N, Denmark, IAROS Applied Bio- EXPEWMEMTAL PROCEDURES 

technology ApS, Gustav WledsveJ 10 f DK-8000 Aarhus ST Denmark, e^MErJTAL PROCEDURES 

IUCSF Cancer Center and Department of Laboratory Medicine, Uni- Mater/a/- Bladder tumor biopsies were sampled after Informed 

versity of California, San Francisco, CA 941 43-0808, and institute consent obtained and after removal of tissue for routine paihol- 
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J^tl^t^S nu ^ r md level Shown from teft to right are chromosome (Cnr.), GGH profiles, gene location and 

ZSSftJ ^J^^^^^ **** the chro ™^' A > expression of mRNMn 

°n4nvaslve counterpart tumor 335. e, expression of mRNA In Invasive tumor 827 compared with the r*>rHhvasrve 

^^^^L^^^^ 3 ^^ Slgna! rat, ° DNAand normal DMA Is shown aJo^ metength of the 

The bcki curve in the ratio profile represents a mean of four chromosomes and Is surrounded by thin curves Nk^rZSd 

Stomal OMT^t!^ 8 f ^d the vertical linel nextto^^k^ anSof 

r^fl^Jl^' cJ TfT >s ^ es **** noninvasive tumor 335 used for comparison showed alterations In DNA content, the ratio 
tn^romosome is shown to the r^nf of the Invasive tumor profile. The colored bars represents one gene each WedHe 
S^T^^^S 8 ? T ^o^^canbe seen at www.MDLDK/sdata,htm^e bars In^emeT^ ^K 
■EffiiE^K ^ dlCat6 ^ Gxpressfon tevel of *• Sene m the Invasive tumor compared with the noninvasive ^SSS^S£ 

^ expression along the chromosome the cofors mdteate that at teast half of the genes were u^irtated ftbcfcL at least hatf^Xoer^ 
*"™>f>U (o/ueKor more than half of the genes are unchanged (orange). If a gene was j£sen7£ 

SX^"" 88 8 **" Chan ^ A 2 - foW ^ ^ ^ this corresponded to one starrfa^a^ 

determination of -1800 genes. Centromeres and heterochromatic regions were excluded from data analysis 



grade I and II, respectively, tumors 733 and 827 were staged as pT1 
(invasive Into submucosa), 733 was staged as solid, and 827 was 
staged as papillary, both grade III. 

mRNA Preparation -Tissue biopsies, obtained fresh from surgery, 
were embedded Immediately In a sodium-guanldlnlum thtocyanate 
solution and stored at -80 °a Total RIMA was isolated using the 
RNAzol B RNA isolation method (WAK-Cnemle Medical GMBH). 
poMA) + RNA was Isolated by an oflgo(dT) selection step (Oligotex 
mRNA kit; Qiagen). 

cRNA Preparation-! p,g of mRNA was used as starting material. 
The first and second strand cDNA synthesis was performed using the 
Superscripts choice system (Invltrogen) according to the manufac- 
turer's Instructions but using an oRgo(dT) primer containing a T7 RNA 
polymerase binding site. Labeled cRNA was prepared using the ME- 
GAscrip© In vitro transcription Wt (Amblon). Biotin-labeled CTP and 



UTP (Enzo) was used, together with unlabeled NTPs In the reaction. 
Following the in vitro transcription reaction, the unincorporated nu- 
cleotides were removed using RNeasy columns (Qiagen). 

Array Hybridization and Scanning— Array hybridization and scary- 
nlng was modified from a previous method (13). 10 **g of cRNA was 
fragmented at 94 °C for 35 mln in buffer containing 40 rrtM Tris 
acetate, pH 8.1, 100 mw KOAc. 30 itim MgOAc. Prior to hybridization, 
the fragmented cRNAIn a 6x SSPE-T hybridization buffer (1 m NaCl| 
10 mM Trts, pH 7.6, 0.005% Triton), was heated to 95 °C for 5 min 
subsequently cooled to 40 °C. and loaded onto the Affymetrix probe 
array cartridge. The probe array was then Incubated for 16 h at 40 °C 
at constant rotation (60 rpm). The probe array was exposed to 10 
washes In 6X SSPE-T at 25 °C followed by 4 washes In 0>5x SSPE-T 
at 60 °C. The bioUnylated cRNA was stained, with a streptavldin- 
phycoerythrin conjugate, 10 /utg/ml (Molecular Probes) in 6x SSPE-T 
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Fig, 1— -continued 



for 30 mtn at 25 *C followed by 10 washes In 6x SSPE-T at 25 °C, The 
probe arrays were scanned at 560 am using a confocal laser scanning 
microscope (made for Affymetrix by Hewlett-Packard). The readings 
from the quantitative scanning were analyzed by Affymetrix gene 
expression analysis software, 

MIcrosatefGte Arw/^—Mtcrosateflite Analysis was performed as 
described previously (14). Microsateflttes were selected by use of 
wwwJWbUiIm.nfh.gov/genemap98, and primer sequences were ob- 
tained from the genome data base at wwwgdb.org. DNA was extracted 
from tumor and blood and amplified by PCR fri a volume of 20 jilfor35 
cycles. The ampOconswem denatured arx^ 

ABI Prism 377. Data were collected In the Gene Scan program for 
fragn^ analysis, loss of to 

of one allele detected In tumor arnpficons compared with blood. 

Pmteomic Anafyste—TCCz were minced Into small pieces and 
homogenized in a small glass homogenlzer In 0.5 ml of lysis solution. 
Samples were stored at -20 °C until use. The procedure for 2D gel 
electrophoresis has been described In detail elsewhere (15, 16). Gels 
were stained with silver nitrate and/or CoorhassJe Brilliant Blue. Pro- 
teins were Identified by a combination of procedures that Included 
microsequencing, mass spectrometry, two-dimensional gel Western 
Immunoblotting, and comparison with the master two-dimensional gel 
Image of human keratinocyte proteins; see biobase.dk/cgi-bln/cefls. 

CGH-Hybrfdlzation of differentially labeled tumor and normal DNA 
to normal metaphase chromosomes was perrormed as described 
previously (10). Ruorescein-labeled tumor DNA (200 ng) t Texas Red- 



labeled reference DNA (200 ng), and human Cot-1 DNA (20 jig) were 
denatured at 37 °C for 5 mln and applied to denatured normal met- 
aphase slides. Hybridization was at 37 °C for 2 days. After washing , 
the slides were counterstained with 0.15 pg/mt 4,6-c3amkiino-2-phe- 
rryiindole In an anti-fade solution. A second hybridization was per- 
formed for all tumor samples using fluoresceln-Jabeted reference DNA 
and Texas Red-labeled tumor DNA (inverse labeling) to confirm the 
aberrations detected during the initial hybridizatioa Each CGH ex- 
periment also included a normal control hybridization using fluores- 
cein- and Texas Red-labeled normal DNA, Digital image analysis was 
used to identify chromosomal regions with abnormal fluorescence 
ratios, indicating regions of DNA gains and losses. The average 
greercred fluorescence intensity ratio profiles were calculated using 
four Images of each chrc*ncsome (eight chromosomes total) with 
normalization of the greerured fluorescence Intensity ratio for the 
entire metaphase and background correction. Chromosome Identifi- 
cation was performed based on 4.6-dlamldino-2-phenyl?ndole band- 
ing patterns. Only Images showing uniform high Intensity fluores- 
cence with minimal background staining were analyzed All 
centromeres, p arms of acrocentric chromosomes, and heterochro- 
matic regions were excluded from the analysis. 

RESULTS 

Comparative Genomic Hybridization— The CGH analysis 
identified a number, of chromosomal gains and losses In the 
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Table I 

Correlation between afteraOohs detected by CQH and by expression monttoring 

Top, CGH used as Independent variable (if CGH alteration - what expression ratio was found); bottom, altered expression used as 
independent variable fit expression alteration - what CGH deviation was found). 



CGH alterations 



Tumor 733 vs. 335 
Expression change clusters 



Concordance CGH alterations 



Tumor 827 vs. 532 
Expression change clusters 



Concordance 



13 Gain 10 Up-regulation 

0 Down-regulation 

3 No change 
10 Loss 1 Up-regulation 

5 Down-regulation 

4 No change 



77% 
50% 



10Gain 
12 Loss 



8 Up-regulation 
0 Down-regulation 

2 No change 

3 Up-regulation 

2 Down regulation 
7 No change 



Expression change clusters 



Tumor 733 vs. 335 
CGH alterations 



Concordance Expression change clusters 



Tumor 827 vs. 532 
CGH alterations 



80% 
17% 

Concordance 



16 Up-regulation 
21 Down-regulation 
15 No change 



11 Gain 

2 Loss 

3 No change 
1 Gain 

8 Loss 

12 No change 
3 Gain 

3 Loss 

9 No change 



69% 
38% 
60% 



17 Up-regulation 
9 Down^regulatlon 
21 No change 



10 Gain 
5 Loss 

2 No change 

0 Gain 

3 Loss 
ONochange 

1 Gain 
3 Loss 

17 No change 



69% 
33% 
81% 



two invasive tumors (stage pT1, TCCs 733 and 827), whereas 
the two non-invasive papillomas (stage pTa, TCCs 335 and 
532) showed only 9p- ( 9q22-q33-, and X-, and 7+, 9q-, 
and Y-, respectively. Both Invasive tumors showed changes 
(1q22-24+, 2q14.1-qter-, 3q12-q13.3-, 6q12-q22~, 
9q34-f , 11q12-q13+, 17+, and 20q11.2-q12+) that are typ- 
ical for their disease stage, as well as. additional alterations, 
some of which are shown in Fig. 1. Areas with gains and 
losses deviated from the normal copy number to some extent, 
and the average numerical deviation from normal was 0.4-fold 
in the case of TCC 733 and 0.3-fold for TCC 827. The largest 
changes, amounting to at least a doubling of chromosomal 
content, were observed at 1q23 In TCC 733 (Fig, 1/*) and 
20q12 In TCC 827 {Fig. 18). 

tnRNA Expression In Relation to DNA Copy Number— The 
mRNA levels from the two invasive tumors (TCCs 827 and 
733) were compared with the two non-invasive counterparts 
(TCCs 532 and 335). This was done In two separate experi- 
ments in which we compared TCCs 733 to 335 and 827 to 
532, respectively, using two different scaling settings for the 
arrays to rule out scaling as a confounding parameter. Ap- 
proximately 1,800 genes that yielded a signal on the arrays 
were searched in the Unlgene and Genemap data bases for 
chromosomal location, and those with a known location 
(1096) were plotted as bars covering their purported locus. In 
that way it was possible to construct a graphic presentation of 
DNA copy number and relative mRNA levels along the Indi- 
vidual chromosomes (Fig. 1). 

For each mRNA a ratio was calculated between the level in 
the Invasive versus the non-invasive counterpart Bars, which 
represent chromosomal location of a gene, were color-coded 
according to the expression ratio, and only differences larger 



than 2-fold were regarded as informative (Fig. 1). The density 
of genes along the chromosomes varied, and areas contain- 
ing only one gene were excluded from the calculations. The 
resolution of the CGH method is very low, and some of the 
outlier data may be because of the fact that the boundaries of 
the chromosomal aberrations are not known at high resolution. 

Two sets of calculatibns were made from the data For the 
first set we used CGH alterations as the independent variable 
and estimated the frequency of expression alterations In these 
chromosomal areas. In general, areas with a strong gain of 
chromosomal material contained a cluster of genes having 
Increased mRNA expression. For example, both chromo- 
somes 1q21 -q25, 2p and 9q, showed a relative gain of more 
than 100% in DNA copy number that was accompanied by 
Increased mRNA expression levels In the two tumor pairs (Fig. 
1). In most oases, chromosomal gains detected by CGH were 
accompanied by an Increased level of transcripts In both 
TCCs 733 (77%) and 827 (80%) (Table I, top). Chromosomal 
losses, on the other hand, were not accompanied by de- 
creased expression in several cases, and were often. regis- 
tered as having unaltered RNA levels (Table I, fop). The Inabil- 
ity to detect RNA expression changes In these cases was not 
because of fewer genes mapping to the lost regions (data not 
shown). 

In the second set of calculations we selected expression 
alterations above 2-fold as the independent variable and es- 
timated the frequency of CGH alterations In these areas. As 
above, we found that Increased transcript expression corre- 
lated with gain of chromosomal material (TCC 733, 69% and 
TCC 827, 59%), whereas reduced expression was often de- 
tected In areas with unaltered CGH ratios (Table I, bottom). 
Furthermore, as a control we looked at areas with no atter- 
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Tumor 827 versus 532 Tumor 733 versus 335 

Fra.2. Correlation between maximum CGH aberration and the ability to detect expression change by oligonucleotide array 
m0r V^ n ?* ^L^Tf?" I l 8h0wn 83 a ^erfcal -fold change In ratio between Invasive tumom 827 (A) and 733 (♦) and their non-Invasive 
counterparts 532 and 335. The expression change was taken from the Express/oh One to the right In Rg. 1. which depicts the resulting 
expression change for a given chromosomal region. At least half of the mRNAs from a given region have to be either up- or down-regulated 

S ?° red 83 expression chan 9 e - A" chromosomal arms in which the CGH ratio plus or minus one standard deviation was outside the 
ratio value of one were included. 



ation In expression. No alteration was detected, by CGH In 
most of these areas (TCC 733, 60% and TCC 827, 81%; see 
Table i, bottom). Because the ability to observe reduced or 
Increased mRNA expression clustering to a certain chromo- 
somal area clearly reflected the extent of copy number 
changes, we plotted the maximum CGH aberrations In the 
regions showing CGH changes against the abUity to detect a 
change in mRNA expression as monitored by the oligonucleo- 
tide arrays (Rg. 2)fe>r both tumors TCC 733 (p < 0,015) and 
TCC 827 <p < 0.OGO03) a highly significant correlation was 
observed between the level of CGH ratio change (reflecting 
the DNA copy number) and alterations detected by the array 
based technology (Rg. ^ Similar data were obtained when 
areas with altered expression were used as independent vari- 
ables. These areas correlated best with CGH when the CGH 
ratio deviated 1.6- to 2.0-fold (Table I, bottom) but mostly did 
not at lower CGH deviations. These data probably reflect that 
loss of an allele may only lead to a 50% reduction In expres- 
sion level, which Is at the cut-off point for detection of expres- 
sion alterations. Gain of chromosomal material can occur to a 
much larger extent. 

Mlcrosatetlfte-based Detection of Minor Areas of Loss- 
es—In TCC 733, several chromosomal areas exhibiting DNA 
amplification were preceded or followed by areas with a nor- 
mal CGH but reduced mRNA expression (see Fig. 1, TCC 733 
chromosome 1q32, 2p21, and 7q21 and q32, 9q34, and 
10q22). To determine whether these results were because of 
undetected loss of chromosomal material in these regions or 



because of other non-structural mechanisms regulating tran- 
scription, we examined two microsatellites positioned at chro- 
mosome 1q25-32 and two at chromosome 2p22. Loss of 
heterozygosity (LOH) was found at both 1q25 and at 2p22 
indicating that minor deleted areas were not detected with the 
resolution of CGH (Rg. 3). Additionally, chromosome 2p In 
TCC 733 showed a CGH pattern of gain/no change/gain of 
DNA that correlated with transcript increase/decrease/in- 
crease. Thus, for the areas showing Increased expression 
there was a correlation with the DNA copy number alterations 
(Fig. 1A). As indicated above, the mRNA decrease observed in 
the middle of the chromosomal gain was because of LOH, 
implying that one of the mechanisms for mRNA down-regu- 
lation may be regions that have undergone smaller losses of 
chromosomal material. However, this cannot be detected with 
the resolution of the CGH method, 

In both TCC 733 and TCC 827, the telomeric end of chro- 
mosome 11p showed a normal ratio In the CGH analysis; 
however, clusters of five and three genes, respectively, lost 
their expression. Two microsatellites (D11S1760, D11S922) 
positioned dose to MUC2, IGF2, and cathepsln D Indicated 
LOH as the most likely mechanism behind the loss of expres- 
sion (data not shown). 

A reduced expression of mRNA observed in TCC 733 at 
chromosomes 3q24, 11p11, 12p12.2, 12q21.1, and 16q24 
and In TCC 827 at chromosome 11p15.5, 12p11, 15q11.2, 
and 18q12 was also examined for chromosomal losses using 
microsatellites positioned as close as possible to the gene loci 
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Fig. 3. Microsateilite analysis of loss of heterozygosity. Tumor 
733 showing toss of heterozygosity at chromosome 1q25, detected 
(a) by D1S215 close to Hu class I WstocompatiWIity antigen (gene 
number 38 .In Fig. 1), (6) by D1S2735 dose to cathepsln E (gene 
number 41 In Fig. 1), and (c) at chromosome 2p23 by D2S2251 close 
to general 0-spectrin (gene number 1 1 on Fig. 1) and of (d) tumor 827 
showing toss of heterozygosity at chromosome 18q12 by S18S1118 
close to mitochondrial 3-^xoacyl-coenzyme A thtotase (gene number 
12 In Rg. 1). The upper curves show the electropherogram obtained 
from normal DNA from leukocytes (N). and the tower curves show the 
electropherogram from tumor DNA (7). In ail cases one allele is 
partially lost In the tumor arnpllcon. 

showing reduced mRNA transcripts. Only the microsateilite 
positioned at 18q12 showed LOH (Fig. 3), suggesting that 
transcriptional down-regulation of genes In the other regions 
may be controlled by other mechanisms. 

Relation between Changes in mRNA and Protein Levels- 
2D-PAGE analysis, in combination with Coomassle Brilliant 
Blue and/or silver staining, was carried out on all four tumors 
using fresh biopsy material, 40 well resolved abundant known 
proteins migrating In areas away from the edges of the pH 
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Fig. 4. Correlation between protein levels as Judged by 2D* 
PAGE and transcript ratio. For comparison proteins were divided in 
three groups, unaltered in level or up- or down-regulated {frorizonte! 
axis). The mRNA ratio as determined by oligonucleotide arrays was 
plotted tor each gene {vertical exfc). A. mRNAs that were scored as 
present In both tumors used for the ratio calculation; A, mRNAs that 
were scored as absent In the Invasive tumors (along horizontal axis) or 
as absent In non-invasive reference {top of figure). Two different 
scaflngs were used to exclude scaling as a confounder, TCCs ©27 
and 532 (AA) were scaled with background suppression, and TCCs 
733 and 335 (#0) were scaled without suppression. Both compari- 
sons showed highly significant (p < 0,005) differences in mRNA ratios 
between the groups. Proteins shown were as follows: Group A (from 
fefl), phosphoglucomutase 1, glutathione transferase class & number 
4, fatty acid-binding protein homologue, cytokeratln 15, and cyto- 
keratin 13; 6 (from fe/f), fatty acid-binding protein homologue, 28-kDa 
heat shock protein, cytokeratln 13, and cafcycUn; C(&om left), <r-eno~ 
lase, hnRNP B1, 28-kDa heat shock protein, 14-3-3-e, and 
pre-mRNA splicing factor; O t mesothelial keratin K7 (type II); B (from 
top), glutathione S-transferase-tr and mesotheflaJ keratin K7 (lyjie.ll); 
F(from top and left), adenytyi cyciase-associated protein, E-cadherin, 
keratin 19, calgizzarin, phosphoglycerate mutase, annexln IV, cy- 
toskeletai y-actin, hnRNP At, integral membrane protein calnexin 
(IP90), hnRNP H, brain-type clathrin light chain-a, hnRNP F, 70-kDa 
heat shock protein, heterogeneous nuclear ribonucleoprotein A/B, 
translattonally controlled tumor protein, fiver glyceraldertyde-3-phos- 
phate dehydrogenase, keratin 8, aldehyde reductase, and Na,K- 
ATPase 0-1 subunif, G, (from top and feff), TCP20, calgizzarin, 70- 
kDa heat shock protein, calnexin. hnRNP H, cytokeratto 15, ATP 
synthase, keratin 1 9, trlosephosphate teomerase, hnRNP F, fiver glyc- 
eraldehyde-3-phosphatase dehydrogenase, glutathione S-transfer- 
ase-ir, and keratin 8; H (from left), plasma getsoTln, autoantlgen cal- 
reticufin, thtoredoxin, and NAD+-dependerrt 15 hydroxyprostaglandin 
dehydrogenase; / prom top), prolyl 4-hydroxytase 0-subunit, cyto- 
keratln 20, cytokeratin 17, prohibition, and fructose 1,6-biphos- 
phatase; J annexln U; K, annexln IV; L (from top and left), 90-kDa heat 
shock protein, prolyl 4-hydroxylase /J-eubunlt, o-enolase, GRP 78, 
cydophllln, and cofifln. 

gradient, and having a known chromosomal location, were 
selected for analysis In the TCC pair 827/532. Proteins were 
Identified by a combination of methods (see "Experimental 
Procedures"). In general there was a highly significant corre- 
lation (p < 0,005) between mRNA and protein alterations (Rg. 
4). Only one gene showed disagreement between transcript 
alteration and protein alteration. Except for a group of cyto- 
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keratins encoded by genes on chromosome 17 (Rg. 6) the 
analyzed proteins did not belong to a particular family. 26 well 
focused proteins whose genes had a know chromosomal 
location were detected in TCCs 733 and 335, and of these 1 9 
correlated (p < 0.005) with the mRNA changes detected using 
the arrays (Fig. 4). For example, PA-FABP was highly ex- 
pressed In the non-invasive TCC 335 but lost In the Invasive 
counterpart (TCC 733; see Rg. 5). The smaller number of 
proteins detected in both 733 and 335 Was because of the 
smaller size of the biopsies that were available. 

11 chromosomal regions where CGH showed aberrations 
that corresponded to the changes In transcript levels also 
showed corresponding changes In the protein level (Table II). 
These regions included genes that encode proteins that are 
found to be frequently altered in bladder cancer, namely 
cytokeratins 17 and 20, annexlns II and IV, and the fatty 
acid-binding proteins PA-FABP and FBP1, Four of these pro- 
teins were encoded by genes In chromosome 17q, a fre- 
quently amplified chromosomal area In Invasive bladder 
cancers. 

DISCUSSION 

Most human cancers have abnormal DNA content, having 
lost some chromosomal parts and gained others. The present 
study provides some evidence as to the effect of these gains 
and losses on gene expression in two pairs of non-invasive 
and invasive TCCs using high throughput expression arrays 
and proteomics, In combination with CGH. In general, the 
results showed that there is a clear individual regulation of the 
mRNA expression of single genes, which in some cases was 
superimposed by a DNA copy number effect In most cases, 
genes located in chromosomal areas with gains often exhib- 
ited increased mRNA expression, whereas areas showing 
losses showed either no change or a reduced mRNA expres- 
sion. The latter might be because of the fact that losses most 
often are restricted to loss of one allele, and the cut-off point 
for detection of expression alterations was a 2-fold change, 
thus being at the border of detection. In several cases, how* 

Table II 

Proteins whose expression level correlates with both mRNA and gene dose changes 



Protein Chromosomal location Tumor TCC CGH alteration Transcript alteration - Protein alteration 



Annexin 11 


1q21 


733 


Gain 


Abs to Pres* 


Increase 


Annexln IV 


2p13 


733 


Gain 


3.9-Fold up 


Increase 


Cytokeratln 17 


17q12-q21 


827 


Gain 


3.8-Fold up 


Increase 


Cytokeratin 20 


17q21.1 


827 


Gain 


5.6-Fold up 


Increase 


(PA-)FABP 


8q21.2 


827 


loss 


10-Fold down 


Decrease 


FBP1 


9q22 


827 


Gain 


2.3-Fold up 


Increase 


Plasma gelsolin 


9q31 


827 


Gain 


Abs to Pres 


increase 


Heat shock protein 28 


15q12-q13 


827 


Loss 


2.5-Fold up 


Decrease 


Prohibftin 


17q21 


827/733 


Gain 


3.7V2.5-Foid up* 


Increase 


Prolyl-4-hydroxyl 


17q25 


827/733 


Gain 


6.7V1. 6-Fold up 


Increase 


hnRNPBI 


7p15 


827 


toss 


2.5-Fold down 


. Decrease 



a Abs, absent; Pres, present. 

b In cases where the corresponding alterations were found in both TCCs 827 and 733 these are shown as 827/733. 




Fig. 5. Comparison of protein and transcript levels in invasive 
and non-invasive TCCs. The upper part of the figure shows a 2D gel 
{/eft) and the oligonucleotide array {right) of TCC 532. The red rectan- 
gles on the upper gel highlight the areas that are compared below. 
Identical areas of 2D gels of TCCs 532 and 827 are shown below. 
Clearly, cytokeratins 13 and 15 are strongly down-regulated In TCC 
827 (red annotation). The tile on the array containing probes for 
cytokeratin 15 is enlarged below the array (red arrow) from TCC 532 
and Is compared with TCC 827. The upper row of squares In each tile 
corresponds to perfect match probes; the tower row corresponds to 
mismatch probes containing a mutation (used for correction for un- 
specific binding}. Absence of signal is depicted as black, and the 
higher the signal the lighter the color. A high transcript level was 
detected In TCC 532 (6151 units) whereas a much lower level was 
detected in TCC 827 (absence of signals). For cytokeratin 13, a high 
transcript level was also present In TCC 532 (15659 units), and a 
much lower level was present in TCC 827 (623 units). The 2D gels at 
the bottom of the figure QefQ show levels of PA-FABP and adipocyte- 
FABP In TCCs 335 and 733 (invasive), respectively. Both proteins are 
down-regutated in the invasive tumor. To the right we show the array 
tiles for the PA-FABP transcript A medium transcript level was de- 
tected in the case of TCC 335 (1277 units) whereas very low levels 
were detected In TCC 733 (166 units). /£F. isoelectric focusing. 
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ever, an Increase or decrease in DNA copy number was arm and that the use of cDMmk^^ 

associated with cte novo occurrence or complete loss of tran- copy number changes will reach a resolution that can resolve 

script, respectively. Some of these transcripts could not be these changes, as has recently been proposed (2). The outlier 

detected In the non-invasive tumor but were present at rela- data were not more frequent at the boundaries of the CQH 

tively high levels In areas with DNA amplications In the inva- aberrations. At present we do not know the mechanism be- 

sive tumors (e.g. In TCC 733 transcript from cellular ligand of hind chromosomal aneuploldy and cannot predict whether 

annexin II gene (chromosome 1q21) from absent to 2670 chromosomal gains will be transcribed to a larger extent than 

arbitrary Units; In TCC 827 transcript from small proline-rich the two native alleles, A mechanism as genetic Imprinting has 

protein 1 gene (chromosome 1q12-q21.1) from absent to an impact on the expression level In normal cells and Is often 

1326 arbitrary units). It may be anticipated from these data reduced in tumors. However, the relation between imprinting 

that significant clustering of genes with an increased expres- and gain of chromosomal material is not known, 

sion to a certain chromosomal area indicates an increased We regard It as a strength of this Investigation that we were 

likelihood of gain of chromosomal material In this area. able to compare invasive tumors to benign tumors rather than 

Considering the many possible regulatory mecharilsrns act- to normal urothelium, as the tumors studied were biologically 

Ing at the level of transcription, it sieems striking that the gene very close and probably may represent successive steps In 

dose effects were so clearly detectable In gained areas. One the progression of bladder cancer. Despite the limited amount 

hypothetical explanation may lie In the loss of controlled of fresh tissue available It was possible to apply three different 

methyiation In tumor cells (17-19). Thus, it may be possible state of the art methods. The observed correlation between 

that In chromosomes with increased DNA copy numbers two DNA copy number and mRNA expression Is remarkable when 

or more alleles could be demethylated simultaneously leading one considers that different pieces of the tumbr biopsies were 

to a higher transcription level, whereas In chromosomes with used for the different sets of experiments. This indicate that 

losses the remaining allele could be partly methylated, turning bladder tumors are relatively homogenous, a notion recently 

off the process (20, 21). A recent report has documented a supported by CGH and LOH data that showed a remarkable 

ploldy regulation of gene expression In yeast, but In this case all similarity even between tumors and distant metastasis (1 0, 23). 

the genes were present in the same ratio (22), a situation that is In the few cases analyzed, mRNA, and protein levels 

not analogous to that of cancer cells, which show marked showed a striking correspondence although in some cases 

chromosomal aberrations, as well as gene dosage effects. we found discrepancies that may be attributed to translational 

Several CGH studies of bladder cancer have shown that regulation, post*transiational processing, protein degrada- 

some chromosomal aberrations are common at certain tion, or a combination of these. Some transcripts belong to 

stages of disease progression, often occurring in more than 1 undertranslated mRNA pools, which are associated with few 

of 3tumors. In pTa tumors, these Indude9p-,9q-,1q+,Y- translationally inactive ribosomes; these pools, however, 

(2, 6), and in.pT1 tumors, 2q-,11p-. 11q-, 1q+, 5p+, 8q+, seem to be rare (24). Protein degradation, for example, may 

17q+, and 20q+ (2-4, 6, 7). The pTa tumors studied here be very Important , in the case of polypeptides with a short 

showed similar aberrations such as 9p- and 9q22~q33- and half-life (e.g. signaling proteins). A poor correlation between 

9q- and Y-, respectively. Likewise, the two minimal invasive mRNA and protein levels was found in liver cells as deter- 

pT1 tumors showed aberrations that are commonly seen at mined by arrays and 2D-PAGE (25), and a moderate correla- 

that stage, and TCC 827 had a remarkable resemblance to the tion was recently reported by Ideker et al. (26) In yeast 

commonly seen pattern of losses and gains, such as 1q22-24 (Interestingly, our study revealed a much better correlation 

amplification (seen In both tumors), 1 1q14-q22 loss, the latter between gained chromosomal areas and Increased mRNA 

often linked to 1 7 q+ (both tumors), and 1q+ and 9p~. often levels than between loss of chromosomal areas and reduced 

linked to 20q+ and 11 q13+ (both tumors) (7-9). These ob- mRNA levels. In general, the level of CGH change determined 

servations indicate that the pairs of tumors used In this study the ability to detect a change in trartscripC) One possible 

exhibit chromosomal changes observed In many tumors, and explanation could be that by losing one allele the change In 

therefore the findings could be of general. Importance for mRNA level Is not so dramatic as compared with gain of 

bladder cancer. material, which can be rather unlimited and may lead to a 

Considering that the mapping resolution of CQH Is of about severalfold Increase In gene copy number resulting In a much 

20 megabases it is only possible to get a crude picture of higher Impact on transcript level. The latter would be much 

chromosomal instability using this technique. Occasionally, easier to detect on the expression arrays as the cut-off point 

we observed reduced transcript levels close to or inside re- was placed at a 2-fold level so as not to be biased by noise on 

glons with increased copy numbers. Analysis of these regions the array. Construction of arrays with a better signal to noise 

by positioning heterozygous mlcrosatellites as dose as pps- ratio may in the future allow detection of lesser than 2-fold 

sible to the locus showing reduced gene expression revealed alterations in transcript levels, a feature that may facilitate the 

loss of heterozygosity In several cases. It seems likely that analysis of the effect of loss of chromosomal areas on tran- 

multlpie and different events occur along each chromosomal script levels. 
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In eleven cases we found a significant correlation between 
DNA copy number, mRNA expression, aid protein level. Four 
of these proteins were encoded by genes located at a fre- 
quently amplified area In chromosome 17q.. Whether DNA 
copy number Is one of the mechanisms behind alteration of 
these eleven proteins is at present unknown and will have to 
be proved by other methods using a larger number of sam- 
ples. One factor making such studies complicated Is the targe 
extent of protein modification that occurs after translation, 
requiring Immunoidentlfication and/or mass spectrometry to 
correctly Identify the proteins In the gels. 

In conclusion, the results presented in this study exemplify 
the large body of knowledge that may be possible to gather in 
the future by combining state of the art techniques that follow 
the pathway from DNA to protein (26). Here, we used a tradi- 
tional chromosomal CGH method, but in the future high reso- 
lution CGH based on microarrays with many thousand radiation 
hybrid-mapped genes will increase the resolution and informa- 
tion derived from these types of experiments (2). Combined with 
expression arrays analyzing transcripts derived from genes with 
known locations, and 20 gel analysis to obtain information at 
the post-translational level, a clearer and mote developed un- 
derstanding of the tumor genome will be forthcoming. 
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ABSTRACT* 

Genetic changes underlie tumor progression and may lead to cancer- 
specific expression of critical genes. Over 110a publications have de-. 
; scribed die use of comparative genomic hybridization (CGH) to analyze 
thepattern of copy number alterations in cancer, but very few of the genes 
affected are known. Here, we performed high-resolution CGH analysis on 
cDNA micro arrays in breast cancer and directly compared copy number 
and mRNA expression levels of 13,824 genes to quantitate me impact of 
genomic changes on gene expression. We Identified and mapped the 
boundaries of 14 independent ampticons, ranging in size from 0.2 to 12 
Mb, Throughout the genome, bom high- and low-level copy number 
changes had a substantial Impact on gene expression, with 44% of the 
Wghly amplified genes showing overexpression and 10.5% of the highly 
pverexpressed genes being ampfified. Statistical analysis with random 
permutation tests Identified 270 genes whose expression levels across 14 
samples were systematically attributable to gene amplification. TJiese 
included most previously described amplified genes, in breast cancer and 
many novel targets for genomic alterations, including the BOXB7 gene, 
the presence of which in a novel amplicon at 17q2U was validated in 
10.2% of primary breast Cancers and associated with poor patient prog, 
nosls. In conclusion, CGH on cDNA microarrays revealed hundreds of 
novel genes whose overexpression is attributable to gene amplification. 
These genes may provide insights to the clonal evolution and progression 
of breast cancer end highlight promising therapeutic targets. 

INTRODUCTION 

Gene expression patterns revealed by cDNA microarrays have 
facilitated classification of cancers into biologically distinct catego- 
riesi some of which may explain the clinical behavior of the tumors 
(1-6). Despite this progress in diagnostic classification, the molecular 
mechanisms underlying gene expression patterns in cancer have re- 
mahe^ehisive, and the utility of gene expression profiling in the 
ideatifi^tion of speafip tfiti^ 

Accrjmulation of genetic defects is thought to underlie the clonal 
evolution" of cancer. Identification of the genes that mediate the effects 
of genetic changes may be important by highlighting transcripts mat 
are actively invol ved in tumor progression. Such transcripts and their 
encoded proteins would be ideal targets for anticancer therapies, as 
demonstrated by the clinical success of new therapies against ampli- 
fied oncogenes, such as ERBB2 mdEGFR p, 8), in breast cancer and 
omer solid tumors. Besides amplifications of known oncogenes, over 





Fig. 1. Impact of g«ne copy somber on global go* expression levels.,! percentage of 
ova- and unoere*|aes8e<fcgeaes (7 axis) according to co^ .Duinber^Bdos QC axUX 
Threshold values used for over- and unoVsrexpresafon were >2J84 (global upper 7% of 
the cDNA ratios) and <0.4826 (global lower 7% of the expression ratios). B> percentage 
of amplified and deleted genes according to expression ratios. Threshold values for 
amplification and deletion were >\.5 tnd <0.7. ' 
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20. recurrent regions of DNA amplification have been mapped in 
breast cancer by CGH 3 (9, 10). However, these amplicons are often 
large ami poorly defined, and their impact on gene expression remains 
unknown. 

We hypothesized that genome-wide identification of those gene 
expression changes that , are attributable to underlying gene copy 
number alterations would highlight transcripts that are. actively in- 
volved in the causation or maintenance of. the malignant phenotype. 
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number variation plays in breast cancer development and progression; 
and (b) identify and characterize those genes, whose mRNA expres- 



3 The abbreviations used are: CGH, comparative genomic hybridization; FISH, fluo- 
rescence In situ hybridization; RT-PCR, reverse transcriptiorvPCR. 
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sibn is most significantly associated with amplification of the corre- 
sponding genomictemplate. 

MATERIALS AND METHODS 

Breast Cancer Gffl lines. Fourteen breast cancer cell lines (BT-20 BT- 
474, HCC1428, Hs558t, MCF7, Mt)A-36l, MDA.436, MDA-453, MDA-468, 
SKBR-3, T-47D, UACC812, ZR-75-1, and ZR-75-30) wet* obtained from the 
Amencan Type Culture Collection (Manassas, VA). Cells were grown under 
reoarmnended culture conditions. Genomic DNA and rnRNA were isolated 
using standard protocols. 

Copy Number and Expression Analyses by cDNA Microarrays. The 
I>repawti<m and printing of the 13,824 cDNA clones on glass slides were 
performed as described (1H3). Of these clones, 1244 represented uncharac- 



were excluded t*om the analysis and were treated as missing values. The 
distributions of fluorescence ratios were used to define outpoints for increased/ 
decreased copy number. Genes with CGH ratio >1.43 (representing the upper 
5% of the CGH ratios across all expernnente) were considered to be afnplified, 
and genes with ratio <0;73 (representing the lower 5%) were considered to be 
deleted. 

Statistical Analysis of CGH and cDNA Mlcroarray Data. To evaluate 
the influence of copy number alterations on gene expression, we applied the 
following statistical approach. CGH and cDNA calibrated intensity ratios were 
log-transformed and normalized using median centering of the values in each 
cell line. Furthermore, cDNA ratios for each gene across an 14 cell lines were 
median centered For each gene, the CGH data were represented by a vector 
that was labeled 1 for amplification (ratio, >L43) and 0 for no amplification. 
Amplification was correlated with gene expression using the signal-to-noise 



jkhw^kju as aescnoeo iiHJ), Of these clones, 244 represented uncharac- . ™P«ncauon was correlated with gene expression using the signal 
terized expressed sequence tags, and the remainder corresponded to known^ 8 ^^ 08 Wc calculated a weight, w m for each gene as^bUpws: 
genes. CGH experiments on eDNA mjcroarrays were done as described (14, 
1 5). Briefly, 20 w of genomic DNA ftom breast cancer cell lines and normal 



Iwrrian WBQs were digested for 14.^18 h with Ahil md tool (Life Technol- 
ogies, Inc^ Rockville, MD) and purified by phenol/chloroform extraction. Six 
Mg of digested cell line DNAs were labeled with Cy3-dUTP (Amersham 
itanacia) and normal DNA with Cy5-dUTP (Amersham Pharmacia) using 
dip Bioprime Labeling kit (Life Technologies, Inc.). Hybridization (14, 15) and 
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where m glt a gl and o^, denote the means and SDs for the expression 
levels for amplified and ncruunplified cell lines, respecttveiy. To assess the 
Statistical significance of each weight, we performed 10,000 random perniu- 



mtJuEsT T u »~ . 1UV '>* "^"waaaon V% ™) ana. sumsucai signincance or each weight, we performed 10,000 random perrnu- 

posthybndization gashes (1 3) were done as. described. For the expression tations of the label vector. The frobability that a gene had a larger oHoua! 
anaryses, a standard reference (Universal Human Reference RNA; Strataiene, weight by random permutation than the original weight was denoted byT/L 
La Jolla, CA) was used in all experiments. Forty wrof reference RNA were low a (<0.05) indicates a strong association between gene expression and 

amplification. . 
Genomic Localization of c|)NA Clones and Amplicon Mapping. Each 



La Jolla,* CA) was used in all experiments. Forty u-g of reference RNA were 
labeled with ty3-dUTP and 3 3 i*g of test rnRNA with CyS^UTP, and the 
labeled cDNAs were hybridized on microarrays as described (1 3, i5). For both 
microarray analyses, a laser confocal scanner (Agilent Technologies, J>alo 
Alto* CA) was used b measure Ihe fluorescence intensities at the target 
locations using the DEARRAY software (16). After background subtraction, 
average mtensities at each clone in the test Irybridization were divided by the 
average fatensity of me .cones^nding clone in me control hybridization. For 
the copy number analysis, the ratios were normalized on the basis of the 



cDNA clone on the rnicroarray was assigned to a Unigene cluster using the 
Unigene Build 14.1* A database of genomic sequence alignment information 
for rnRNA sequences was created from the August 2001 freeze of the Uni- 
versity of California Santa Cruz's GoldenPath database. 7 The chromosome and 
bp positions for each cPNA clone were men retrieved by relating these date 
sets, Amplicons were defined as a CGH copy number ratio >L0 m at least two 



. . — wv ww » uuxuittuica on me oasis or me . sew, /unpucons were oennea as a UJH copy number ratio >2,0 in at least two 

distnbunon of ratios of all targets on the array and for the expression analysis adjacent clones m two or mc« cell lines or a CGH ratio >2.0 in at least three 

on the basis of 88 housekeeping genes, which were spotted four times onto the adjacent clones in a single cell line. The amplicon start and end positions were 

array. Low quabty measurements (/.e., copy number o^te wim mean reference . ; 

intensity <roo fluorescent units, and expression data with both test and 6,^^^^.,, . . . ' • . _ • 

reference intensity < IN fluoresce 7 lS a^ 
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Ttolcl Summary of independent amplicons in J 4 breast cancer cell lines by 
. " CGRmbroarray 



Location 



Start (Mb) 



lp13 
1421 
lq22 
3pH 

7g31 
7q32 

&0MI-gg2M3 
8q21J 

8q#3-8o24.I4 

8q24.22 

9pl3 

13<j22-<i31 . 
17qll 

17ql2-q21.2 

17q2U2-421.33 

17^2-^23.3 

17<J233-^24J 

19ql3 

20qll.22 

20ql3.12 

20ql3.12-ql3.13 

20ql3.2-ql3.32 



End (Mb) 



Size (Mb) 



13*79 
17192 
179.28 
71.94 
55,62 
125.73 
140.01 
86.45 
98.45 
129.88 
\S\2\ 
38.65 
77.15 
86.70 
29.30 
39.79 
5Z47 
63.81 
6953 
40.63 
.34.59 
44.00 
46.45 
51^32 



132.94 
177,25 
17937 
74.66. 
60.95 
13056 
140.68 
92.46 
103.05 
142.15 
I5Z16 
.39.25. 
8138 
87.62 
3a85 
42.80 
55.80 
69.70 
7459 
41.40 
35J5 
45.62 
49.43 
59J2 



.02 
33 
03 
2.7 
53 
52 
0.7 
6.0 
4.6 
123 
1.0 
0.6 
42 
0,9 
1.6 
3.0 
33 
5.9 
5.1 
0.8 
13 
1.6 
3.0 
. 7.8 



extended to include neighboring nonamplificd clones (ratio, <\S). The am- 
-peon size determination was partially dependent on local clone density. 

FISH. Dual-color interphase FISH to breast cancer cell lines was done as 
described (17). Bacterial artificial chromosome clone RPM-36iK8 was la- 
beled with SpectnimOrange (Vysis, Downers Grove, IL), and Spectrum- 
Orange-labeled probe for EGFR was obtained rrom Vysis. SpectrumGreen- 
labeled chromosome 7 and 17 centromere probes (Vysis) were used as a 
rererence. A tissue microarray containing 612 formalin-fixed, paraffin-embed- 
oed primary breast cancers (17) was applied in FISH analyses as described 
- 0 fi )' I** of these specimens was approved by the Ethics Committee of the 
University of Basel and by the NM. Specimens containing a 2-fold or higher 
mcrease is the number of test probe signals, as compared with corresponding 
centromere, signajs, in at least 10% of the tumor cells were considered to be 
amplified.. Survival analysis was performed using the Kaplan-Meier method 
and the log-rank test 

KT-PCR. The HOXB7 expression level was . determined relative to 
GAPDH. Reverse transcription and PCR amplification were performed using 
Access RT-PCR System (Promega Corp., Madison, Wl) with 10 ngofmRNA 
as a template. HOXB7 primers were 5'-OAGCAOAQ<3GACTCOOACTT-3 ' 
and 5'-GCOTCAOOTAGCOATTOTA0«-3'. 

RESULTS . • ~ ■ 

Global EfTect of Copy Number, on Gene Expression. 13,824 
arrayed cDNA clones were applied for analysis of gene expression 
and gene copy number (CGH microarrays) in 14 breast cancer cell 
lines. The results illustrate a considerable influence of copy number 
on gene expression patterns. Up to 44% of foe highly amplified 
transcripts (CGH ratio, >2.5) were oveiexpressed (£*., belonged to 
the global upper 7% of expression ratios), cotnpared with only 6% for 
genes with normal copy number levels (Fig. 1A). Conversely, 10.5% 
of the transcripts, with high-level expression (cDNA ratio, >10) 
showed, increased copy number (Fig. IB). Low-level copy number 
increases and decreases were also associated, with similar, although 
less dramatic, outcomes on gene expression (Fig. 1). 

Identification of Distinct Breast Cancer Amplicons. Base-pair 
locations obtained for 1 1,994 cDNAs (86.8%) were used to plot copy 
number changes as a function of genomic position (Fig. 2, Supple- 
ment Fig. A). The average spacing of clones throughout the genome 
was 267 kfc This nlgh-resolution mapping identified 24 independent 
breast cancer amplicons, spanning from 0.2 to-12 Mb of DNA (Table 
1). Several amplification sites detected previously by chromosomal 



CGH were validated, with lq21, 17ql2-q21^ 17q22-q2i, 20ql3 1 
and 20ql3.2 regions being most commonly amplified. FurtoermorV 
the boundaries of these amplicons were precisely delineated In ad' 
dition, novel amplicons were identified at 9pl3 (38.65-39 25 MKV 
and 17q2U (52,4^55,80 Mb). . ' °* 

Direct Identification of Putatiye Amplification target Genes. 
The cDNA/CGH microarray technique enables the direct correla- 
tion of copy number and expression data on a gene-by-gene basis 
throughout the genome. We directly annotated W^resolutiorj 
CGH plots with gene expression data using color coding. Fig. 2C 
shows that most of the amplified genes in the MCF-7 breast cancer 
cell line at lpl3, 17q22-q23, and 20ql3 were highly overex- 
pressed. A view of chromosome 7 in. the lyIDA-468 cell line 
implicates EGFk as, the most highly bverexpressed and arriplified 
gene at 7pll-*>12 (Fig.. 3,4). In BT-474, the two known amplicons 
at 17ql2 ; and 17q22-q23 contained numerous highly overex- 
pressed genes (Fig. IB). In addition, several genes, including the 
homeobox genes HOXB2 and HOXB7, were highly amplified in a 
previously undescribed independent an^licpn.at. 17^21.3. HOXB7 
was systematically ampiified (as validated by. FliSW, Fig. 3B. inset) 
as weH as overexpressed (as verified:by RT-PCR, data not shown) 
in BT-474, OACC812, and 2R-7S-:30 cells. Furthermore, this novel 
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Fig. 1 Annotation of gene expression data on COT micioatray profiles, ii genes in ihe 
7pl 1*12 amphcon in the MDA-468 ceil line are highly expressed {red Hots) arid include 
the EGFR oncogene. B. several genes in the I7ql2,. 17q213, and 17q23 amplicons in die 
BT-474 breast cancer cell line are highly overexpressed (red) and include die HQXB7 
gene. The data labcU and color coding are as indicated for Fig. 2C Insets shoy 
chrornosonial CGH profiles for the corresponding chromosomes and validation of the 
Increased copy number by interphase FISH using EGFR (red) and chromosome 7 
ccntiotnerc probe (green) to MDA-468. Qf) and WAB7-specific probe (red) and chro- 
mosome 17 centromere (green) to BT-474 ceils (j5> 



QSNE EXPRESSION PATTERNS IN. BREAST GANGER • 

level copy aumber increase. I^w-level copy number gains and losses between JTO*B7 amplification and poor patient prognosis Overall 
also had a significant influence on expression levels of genes in the our results illustrate how the identification of genes activated!™ 
regions affected, but these effects were more subtle on a gene-by-gene gene amplification provides a powerful approach to M*hH*ht 
basis than those of high-level amplifications. However, the impact of genes with an important role in cancer as well as to prioritize and 
tow-level gains on the dysregulation of gene expression patterns in validate putative targets for therapy development 
cancer may be equally important if not more important than that of 
high-level amplifications. Aneuploidy and low-level gains and losses 
of chromosomal arms represent the most common types of genetic 
alterations in breast and other cancers and, therefore, have an influ- 
ence on many genes. Our results in breast cancer extend the recent 
studies on the. impact of aneuploidy on global gene expression pat- 

. terns in yeast cells, acute myeloid leukemia, and a prostate cancer 
model system (22-24). 

The CGH microarray analysis identified 24 independent breast 
cancer amplicons. We defined the precise boundaries for, many am- 

. plfcons detected previously by chromosomal CGH (9, 10, 25, 26) and 

, also discovered novel amplicons that had not been detected previ- 
ously, presumably because of (heir small size (only 1-2 Mb) or close 
proximity to other larger amplicons; One of these novel amplicons 
involved the homeobox gene region at 17q21 3 and led to the over- 
expression of the HOXB7 and HOXB2 genes. The homeodomain 
franscription factors are'lcnown to be key regulators of embryonic 
development and have been occasionally reported to undergo aberrant 
expression in cancer (27, 28). HOXB7 transfection induced cell pro- 
liferation in melanoma, breast, and ovarian cancer cells and increased 
tumorigenicity and angiogenesis in breast cancer (29-32), The pres- 
ent results imply that gene amplification may be a prominent mech- 
anism for overexpressing tiOXBl in breast cancer and suggest mat 
HOXB7, contributes to tumor progression and confers an aggressive 
disease phenotype in breast cancer. This view is supported by our 
finding of amplification of HOXB7 in 10% of 363 primary breast 
cancers, as well as an association of amplification With poor prognosis 

: of the patients. " 

We carried out a systematic search to identify genes whose 
expression levels across all 14 cell lines were attributable to 
amplification status. Statistical analysis revealed 270 such genes 
(representing ~2% of all genes on the array), including not only 
previously , described amplified genes, such as HER-2, AfYC, 
EGFR, ribosomal protein s6 kinase, and AIB3 9 but also numerous 
novel genes such as NRAS-related gene (lp!3), syndecan-2 (Sq22), 
and bone morpkogentc protein (20ql3.1), whose activation by 
amplificatfoi may similarly promote breast cancer profession. 
Most of the 270; genet have not been implicated previously in 
breast cancer development and suggest novel pathogenetic mech- 
anisms. Although we would not expect all of them to be causally 
involved, it is intriguing that 84% of the genes with associated 
functional information were implicated in apoptosis, cell prolifer- 
ation, signal transduction, transcription, or other cellular processes 
that could directly imply a possible role in cancer progression. 
Therefore, a detailed characterization of these genes may provide 
biological insights to breast cancer progression and might lead to 
the development of novel therapeutic strategies. 

In summary, we. demonstrate application of cDNA microarrays 
to the analysis of both copy number and expression levels of over 
12,000 transcripts throughout the breast cancer genome, roughly 
once every 267 kb. This analysis provided: (a) evidence of a 
prominent global influence of copy number changes on gene 
expression levels; (b) a high-resolution map of 24 independent 
amplicons in breast cancer, and (c) identification of a set of 270 
genes, the oveTexpression of which was statistically attributable to 
gene amplification: Characterization of a novel amplicon at 
17q21.3 implicated amplification and oyerexpression of the 
HOXB7 gene in breast cancer, including a clinical association 
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Microarray analysis reveals a major direct role of 
DNA copy number alteration in the transcriptional 
program of human breast tumors 

Comprehensive Cancer Center, Uni^slty o?Nor* O^-S^H^z!^ ™* *** ,Departmer1t <* Ge ^« ^ 
Ctorrtributed by Patrick 0. Brown, August 6, 2002 



Genomic DNA copy number alterations are key genetic events in 
the development and progression of human cancers. Here we 
report a genome-wide microarray comparative genomic hybrid- 
ization (array CGH) analysis of DNA copy number variation In 
a series of primary human breast tumors. We have profiled DNA 
copy number alteration across 6.691 mapped human genes, in 44 
predominantly advanced, primary breast tumors and 10 breast 
cancer ceO lines. White the overall patterns of DNA amplification 
and deletion corroborate previous cytogenetic studies, the high- 
resolution (gene-by-gene) mapping of ampiicon boundaries and 
the quantitative analysis of ampiicon shape provide significant 
Improvement In the localization of candidate oncogenes. Parallel 
microarray measurements of mRNA levels reveal the remarkable 
degree to which variation in gene copy number contributes to 
variation In gene expression in tumor cells. Specifically, we find 
that 62% of highly amplified genes show moderately or highly 
elevated expression, that DNA copy number influences gene ex- 
pression across a wide range of DNA copy number alterations 
(deletion, low. mid- and high-level amplification), that on average, 
a 2-foW change in DNA copy number is associated with a corre- 
sponding 1.5-fbld change In mRNA levels, and that overall, at least 
12% of an the variation in gene expression among the breast 
tumors Is directly attributable to underlying variation in gene copy 
number. These findings provide evidence that widespread DNA 
copy number alteration can lead directly to global deregulation of 
gene expression, which may contribute to the development or 
progression of cancer. 

Conventional cytogenetic techniques, including comparative 
genomic hybridization (CGH) (1), have led to the Identifi- 
cation of a number of recurrent regions of DNA copy number 
alteration in breast cancer cell lines and tumors (2-4). While 
f° me Zt£?* k&o™ <»ntain known or candidate oncogenes 
£|^ OFR J (SP 11 )* (8q24), CCND1 (Uql3), ERBB2 

[RBI (I3ql4) and TP53 (17pl3)], the relevant gene(s) within 
other regions (e^., gain of lq, 8q22, and 17q22-24, and loss of 
Sp) remain to be identified A high-resolution genome-wide 
map, delineating the boundaries of DNA copy number alter- 
ations in tumors, should facilitate the localization and identifi- 
cation of oncogenes and tumor suppressor genes in breast 
cancer. In this study, we have created such a map, usine 
array-based CGH (5-7) to profile DNA copy number alteration 
in a senes of breast cancer cell lines and primary tumors. 

An unresolved question is the extent to which the widespread 
pNA copy number changes that we and others have identified 
in breast tumors alter expression of genes within involved 
regions. Because we had measured mRNA levels in parallel in 
the same samples (8), using the same DNA microarrays, we had 
an opjwrtunity to explore on a genomic scale the relationship 
Detween DNA copy number changes and gene expression; From 



this analysis, we have identified a significant impact of wide- 
spread DNA copy number alteration on the transcriptional 
programs of breast tumors. 

Materials and Methods 

Tumors and Cell lines. Primary breast tumors were predominantly 
large (>3 cm), intermediate-grade, infiltrating ductal carcino- 
mas, with more than 50% being lymph node positive. The 
traction of tumor cells within specimens averaged at least 50%. 
Details of individual tumors have been published (8, 9Y and 
are summarized in Table 1, which is published as supporting 
information on the PNAS web site, www;pnas.org. Breast cancer 
cell lines were obtained from the American Type Culture 
Collection. Genomic DNA was isolated either using Qiagen 
genomic DNA columns, or by phenol/chloroform extraction 
followed by ethanol precipitation. 

DNA Labeling and Microarray Hybridizations. Genomic DNA label- 
ing and hybridizations were performed essentially as described 
l 1 ££kck et aL (7). with slight modifications. Two micrograms 
of DNA was labeled in a total volume of 50 microliters and the 
volumes of all reagents were adjusted accordingly. "Test" DNA 
(from tumors and cell lines) was f luorescently labeled (Cy5) and 
hybndized to a human cDNA microarray containing 6,691 
different mapped human genes (Le„ UniGene clusters). The 
"reference*' (labeled with Qr3) for each hybridization was nor- 
mal female leukocyte DNA from a single donor. The fabrication 
of cDNA microarrays and the labeling and hybridization of 
mRNA samples have been described (8). 

Data Analysis and Map Positions. Hybridized arrays were scanned 
on a GenePfr scanner (Axon Instruments, Foster Oty, CA), and 
fluorescence ratios (test/reference) calculated using SCANAlyze 
software (available at htt^://ranaJbLgov). Fluorescence ratios 
were normalized for each array by setting the average Jog 
fluorescence ratio for all array elements equal to 0* Measure- 
ments with fluorescence intensities more than 20% above back- 
ground were considered reliable. DNA copy number profiles 
that deviated significantly from background ratios measured in 
normal genomic DNA control hybridizations were interpreted as 
evidence of real DNA copy number alteration (see Estimating 
Significance of Altered Fluorescence Ratios in the supporting 
information). When indicated, DNA copy number profiles are 
displayed as a moving average (symmetric 5-nearest neighbors). 
Map positions, for arrayed human cDNAs were assigned by 



Abbreviation: CGH. comparative genomic hybridization. 

*To whom reprint requests should be addressed at Department of Pathology Stanford 
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IdenUfymg the starting position of the best and longest match of 
^I5 quen ? e reprc$cntedin corresponding UniGene 

SSS/? 0 * ^ Iden Path " S^omT assembly 

(http.//genome.ucsc.edu/; Oct 7, 2000 Freeze). For UniGene 
clusters represented by multiple arrayed elements, mean fluo- 
rescence ratios (for all elements representing the same UniGene 
cluster) are reported. For mRNA measurements, fluorescence 
ratios are mean-centered" reported relative to the mean 
ratio across the 44 tumor samples). The data set described here 
can be accessed in its entirety in the supporting information. 

Results 

We performed CGH on 44 predominantly locally advanced, 
primary breast tumors and 10 breast cancer cell lines, using 
ll /ST? containing 6,691 different mapped human 
genes (Fig. la; also see Materials and Methods for details of 
microarray hybridizations). To take foil advantage of the Im- 
proved spatial resolution of array CGH, we ordered (f luores- 

E£?,? r) thC 6 ' m cDNAs ^cording to the "Golden 
rath (http://genome.ucsc.edu/) genome assembly of the draft 
human genome sequences (11). In so doing, arrayed cDNAs not 
only ^ themselves represent genes of potential interest (e.g. 

S2?i* T°%?? ^ am P licons >' bu « P^tdc preeile 
genetic landmarks for chromosomal regions of amplification and 



12964 



www4»na^org/cgl/dol/iai073/pnaf.16247l99d 



deletion. Parallel analysis of DNA from cell lines containing 
efferent numbers of X chromosomes (Fig. lb), as we did before 
(7), demonstrated the sensitivity of our method to detect sinrfe- 

StoKJSr^^ <4?>XXX) ' % < 48 ^ D «)^r 
z^-fiold (49^CXXXX) gams (also see Fig. 5, which is published 
as supporting information on the PNAS web site). Fluorescence 
ratios were linearly proportional to copy number ratios, which 
were slightly imoerestimated, in agreement with previous ob- 
servations <7). Numerous DNA copy number alterations were 
evident in both the breast cancer cell lines and primary tumors 
(Fig. la), detected in the tumors despite the presence of euploid 
non-tumor cell types; the magnitudes of the observed changes 
were generally lower in the tumor samples. DNA copy-number 
alterations were found in every cancer cell line and tumor, and 
on every human chromosome in at least one sample. Recurrent 
regions of DNA copy number gain and toss were readily iden- 
tifiable. For example r gains within lq, 8q, 17q, and 20q were 
l n a ^ Portion of breast cancer cell lines/tumors 
(90%/69%, 100%/47%, 100%/60%, and 90%/44%, respective- 
ly), as were losses within lp, 3p, 8p, and 13q (S0%/2A% 
80%/22% f 80%/22%, and 7M/^%cc^7!^ 
with published cytogenetic studies (refc. 2-4; a complete listing 
of gains/losses is provided in Tables 2 and 3, which are published 
as supporting information on the PNAS web site). The total 
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number of genomic alterations (gains and losses) was found to 
be significantly higher In breast tumors that were high grade (P » 
0.008), consistent with published CGH data (3), estrogen recep- 
tor negative (P « 0.04), and harboring TP53 mutations (P « 
0.0006) (see Table 4, which is published as supporting informa- 
tion on the PNAS web she). 

The improved spatial resolution of our array CGH analysis is 
illustrated for chromosome 8, which displayed extensive DNA 
copy number alteration in our series. A detailed view of the 
variation in the copy number of 241 genes mapping to chromo- 
some 8 revealed multiple regions of recurrent amplification; 
each of these potentially harbors a different known or previously 
uncharacterized oncogene (Fig. 2a). The complexity of amplicbn 
structure is most easily appreciated in the breast cancer cell line 
SKBR3. Although a conventional CGH analysis of 8q in SKBR3 
identified only two distinct regions of amplification (12), we 
observed three distinct regions of high-level amplification (la- 
beled 1-3 in Fig. 2b). For each of these regions we can define the 



boundaries of the interval recurrently amplified in the tumors we 
examined; in each ease, known or plausible candidate oncogenes 
can be identified (a description of these regions, as well as the 
recurrently amplified regions on chromosomes 17 and 20, can be 
found in Figs. 6 and 7, which are published as supporting 
information on the PNAS web site). 

For a subset of breast cancer cell lines and tumors (4 and 37, 
resrx>ctivery), and a subset of arrayed genes (6,095), mRNA 
levels were quantitatively measured in parallel oy using cDNA 
microarrays (8). The parallel assessment of mRNA levels is 
useful in the interpretation of DNA copy number changes. For 
example, the highly amplified genes that are also highly ex- 
pressed are the strongest candidate oncogenes within an ampli- 
con. Perhaps more significantly, our parallel analysis of DNA 
copy number changes and mRNA levels provides us the oppor- 
tunity to assess the global impact of widespread DNA copy 
number alteration on gene expression in tumor cells. 

A strong influence of DNA copy number on gene expression 
is evident in an examination of the pseudocolor representations 
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of DNA copy number and mRNA levels for genes on chromo- 
some 17 (Fig. 3). The overall patterns of gene amplification and 
elevated gene expression are quite concordant; Le^ a significant 
fraction of highly amplified genes appear to be correspondingly 
highly expressed. The concordance between high-level amplifi- 
cation and increased gene expression is not restricted to chro- 
mosome 17. Genome-wide, of 117 high-level DNA amplifica- 
tions (fluorescence ratios >4, and representing 91 different 
pus), 62% (representing 54 different genes; see Table 5, which 
is published as supporting information on the PNAS web site) 
are found associated with at least moderately elevated mRNA 
levels (mean-centered fluorescence ratios >2) f and 42% (rep- 
resenting 36 different genes) are found associated with compa- 
rably highly elevated mRNA levels (mean-centered fluorescence 
ratios >4). 

To determine the extent to which DNA deletion and lower- 
level amplification (in addition to high-level amplification) are 
also associated with corresponding alterations in mRNA levels, 
we performed three separate analyses on the complete data set 
(4 cell lines and 37 tumors, across 6,095 genes). First, we 
determined the average mRNA levels for each of five classes 
of genes, representing DNA deletion, no change, and low-, 
medium-, and high-level amplification (Fig. 4a). For both the 



breast cancer cell lines and tumors, average mRNA levels 
tracked with DNA copy number across all five classes, in a 
statistically significant fashion (P values for pair-wise Student's 
f tests comparing adjacent classes: cell lines, 4 x 10" 4 * 1 x 10" 49 , 
5 X tO- 5 , 1 x 10"H tumors, 1 x 10"* 3 , 1 x 10-»* 5 X 10~«*, 
1 x 10~ 4 ), A linear regression of the average log(DNA copy 
number), for each class, against average log(mRNA level) 
demonstrated that on average, a Mold change in DNA copy 
numberwas accompanied by 1.4- and 15 -fold changes in mRNA 
level for the breast cancer cell lines and tumors, respectively (Fig. 
4a, regression line not shown). Second, we characterized the 
distribution of the 6,095 correlations between DNA copy num- 
ber and mRNA level, each across the 37 tumor samples (Fig. 4b) . 
The distribution of correlations forms a normal-shaped curve, 
but with the peak markedly shifted in the positive direction from 
zero. This shift is statistical^ significant, as evidenced in a plot 
of observed vs. expected correlations (Fig. 4c), and ref lects a 
pervasive global influence of DNA copy number alterations on 
gene expression. Notably, the highest correlations between DNA 
copy number and mRNA level (the right tail of the distribution 
in Fig. 4b) comprise both amplified and deleted genes (data not 
shown). Third, we used a linear regression model to estimate the 
fraction of all variation measured in mRNA levels among the 37 
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tumors that could be attributed to underlying variation in DNA 
copy number. From this analysis, we estimate that, overall, about 
7% of all of the observed variation in mRNA levels can be 
explained directly by variation in copy number of the altered 
genes (Fig. 4d). We can reduce the effects of experimental 
measurement error on this estimate by using only that fraction 
of the data most reliably measured (fluorescence intensity/ 
background >3); using that data, our estimate of the percent 
variation in mRNA revels directly attributed to variation in gene 
copy number increases to 12% (Fig. 4^). This still undoubtedly 
represents a significant underestimate, as the observed variation 
in global gene expression is affected not only by true variation in 
the expression programs of the tumor cells themselves, but also 
by the variable presence of non-tumor cell types within clinical 
samples. 

Discussion 

This genome-wide, array CGH analysis of DNA copy number 
alteration in a series of human breast tumors demonstrates the 
Usefulness of defining amplicon boundaries at high resolution 
(gene-by-gene), and quantitatively measuring amplicon shape, to 
assist in locating and identifying candidate oncogenes. By ana- 
lyzing mRNA levels in parallel, we have also discovered that 
changes in DNA copy number have a large, pervasive, direct 
effect on global gene expression patterns in both breast cancer 



cell lines and tumors. Although the DNA microarrays used in our 
analysis may display a bias toward characterized and/or highly 
expressed genes, because we are exarmning such a large fraction 
of the genome (approximately 20% of all human genes), and 
because, as detailed above, we are likely underestimating the 
contribution of DNA copy number changes to altered gene 
expression, we believe our findings are likely to be generalizable 
(but would nevertheless still be remarkable if only applicable to 
this set of -6,100 genes). 

In budding yeast, aneuploidy has been shown to result in 
chromosome-wide gene expression biases (13). Two recent 
studies have begun to examine the global relationship between 
DNA copy number and gene expression in cancer cells. In 
agreement with bur findings, Phillips et aL (14) have shown that 
with the acquisition of tumorigenicity in an immortalized pros- 
tate epithelial cell line, new chromosomal gains and losses 
resulted in a statistically significant respective increase and 
decrease in the average expression level of involved genes. In 
contrast, Platzer et a!. (15) recently reported that in metastatic 
colon tumors only -4% of genes within amplified regions were 
found more highly (>2-fbld) expressed, when compared with 
normal colonic epithelium. Hiis report differs substantially from 
our finding that 62% of highty amplified genes in breast cancer 
exhibit at least 2-fold increased expression. These contrasting 
findings may reflect methodological differences between the 
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studies. For example, the study of Platzer et al (15) may have 
systematically under-measured gene expression changes. In this 
regard it Is remarkable that only 14 transcripts of many thousand 
residing within unamplified chromosomal regions were found to 
exhibit at least 4-fold altered expression in metastatic colon 
cancer. Additionally, their reliance on lower-resolution chromo- 
somal CGH may have resulted in poorly delimiting the bound- 
aries of high-complexity aropHcons, effectively overcaliing re- 
gions with amplification. Alternatively, the contrasting findings 
for amplified genes may represent real biological differences 
between breast and metastatic colon tumors; resolution of this 
issue will require further studies. 

Our finding that widespread DNA copy number alteration has 
a large, pervasive and direct effect on global gene expression 
patterns in breast cancer has several important implications, 
First, this finding supports a high degree of copy number- 
dependent gene expression in tumors. Second, it suggests that 
most genes are not subject to specific autoregulation or dosage 
compensation. Third, this finding cautions that elevated expres- 
sion of an amplified gene cannot alone be considered strong 
independent evidence of a candidate oncogene's role in tumor- 
igenesis. to our study, fully 62% of highly amplified genes 
demonstrated moderately or highly elevated expression. Tnis 
highlights the importance of high-resolution mapping of ampli- 
con boundaries and shape {to identify the "driving** gene(s) 
within amplicons (16)], on a large number of samples,in addition 
to functional studies. Fourth, this finding suggests that analyzing 
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the genomic distribution of expressed genes, even within existing 
microarray gene expression data sets, may permit the inference 
of DNA copy number aberration, particularly aneuploidy (where 
gene expression can be averaged across large chromosomal 
regions; see Fig. 3 and supporting information). Fifth, this 
finding implies that a substantial portion of the phenotypic 
uniqueness (and by extension, the heterogeneity in clinical 
behavior) among patients' tumors may be traceable to underly- 
ing variation in DNA copy number. Sixth, this finding supports 
a possible role for widespread DNA copy number alteration in 
tumorigenesis.(17, 18), beyond the amplification of specific 
oncogenes , and deletion of specific tumor suppressor genes. 
Widespread DNA copy number alteration, and the concomitant 
widespread imbalance in gene expression, might disrupt critical 
stochioraetric relationships in cell metabolism and physiology 
(e.g., proteosome, mitotic spindle), possibly promoting further 
chromosomal instability and directly contributing to tumor 
development or progression. Finally, our findings suggest the 
possibility of cancer therapies that exploit specific or global 
imbalances in gene expression in cancer. 
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FROM DNA TO RNA 

Transcription and translation are the means by which cells read out, or express, 
the genetic instructions in their genes. Because many identical RNA copies can 
be made from the same gene, and each RNA molecule can direct the synthesis 
of many identical protein molecules, cells can synthesize a large amount of 
protein rapidly when necessary. But each gene can also be transcribed and 
translated with a different efficiency, allowing the cell to make vast quantities of 
some proteins and tiny quantities of others (Figure 6-3). Moreover, as we see in 
the next chapter, a cell can change (or regulate) the expression of each of its 
genes according to the needs of the moment— most obviously by controlling 
the production of its RNA. 



Figure 6-3 Genes can be expressed 
with different efficiencies. Gene A is 
transcribed and translated much more 
efficiently than gene B.7his allows the 
amount of protein A In the cell to be 
much greater than that of protein B. 



Portions of DNA Sequence Are Transcribed into RNA 

The first step a cell takes in reading out a needed part of its genetic instructions 
is to copy a particular portion of its DNA nucleotide sequence— a gene— into an 
RNA nucleotide sequence. The information in RNA, although copied into another 
chemical form, is still written in essentially the same language as it is in DNA— 
the language of a nucleotide sequence. Hence the name transcription. 

like DNA, RNA is a linear polymer made of four different types of nucleotide 
subunits linked together by phosphodiester bonds (Figure 6-4). It differs from 
DNA chemically in two* respects: (1) the nucleotides in RNA are 
ribonucleotides— that is, they contain the sugar ribose (hence the name ribonu- 
cleic acid) rather than deojcyribose; (2) although, like DNA, RNA contains the 
bases adenine (A), guanine (G), and cytosine (C), it contains the base uracil (U) 
instead of the thymine (T) in DNA. Since U, like T, can base-pair by hydrogen* 
bonding with A (Figure 6-5), the complementary base-pairing properties 
described for DNA in Chapters 4 and 5 apply also to RNA (in RNA, G pairs with 
C, and A pairs with U). It is not uncommon, however, to find other types of base 
pairs in RNA: for example, G pairing with U occasionally. 

Despite these small chemical differences, DNA and RNA differ quite dra- 
matically in overall structure. Whereas DNA always occurs in cells as a double- 
stranded helix, RNA is single-stranded. RNA chains therefore fold up into a 
variety of shapes, just as a polypeptide chain folds up to form the final shape of 
a protein (Figure 6-6). As we see later in this chapter, the ability to fold into com- 
plex three-dimensional shapes allows some RNA molecules to have structural 
and catalytic functions. 



Transcription Produces RNA Complementary to 
One Strand of DNA 

All of the RNA in a cell is made by DNA transcription, a process that has cer- 
tain similarities to the process of DNA replication discussed in Chapter 5. 
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Figure 6-89 Protein aggregates that cause human disease. (A) Schematic illustration of the type of 
conformational change In a protein that produces material for a cross-beta filament. (B) Diagram illustrating 
the self-infectious nature of the protein aggregation that is central to prion diseases. PrP is highly unusual 
because the misfblded version of the protein, called PrP*, induces the nfarmal PrP protein it contacts* to 
change Its conformation, as shown. Most of the human diseases caused by protein aggregation are caused by 
the overproduction of a variant protein that is especially prone to aggregation, but because this structure Is 
not infectious In this way, it cannot spread from one animal to another. (Q Drawing of a cross-beta filament 
a common type of pro tease-resistant protein aggregate found in a variety of human neurological diseases. 
Because the hydrogen-bond interactions in a P sheet form between polypeptide backbone atoms (see Figure 
3-9), a number of different abnormally folded proteins can produce this structure. (D) One of several 
possible models for the conversion of PrP to PrP*, showing the likely change of two a-helices into four 
p-sq-ands. Although the structure of the normal protein has been determined accurately, the structure of the 
infectious form is not yet known with certainty because the aggregation has prevented the use of standard 
structural techniques. (C, courtesy of Louise Serpell, adapted from M. Sunde et al J. Mo/. B/oL 273:729-739, 
1 997; D, adapted from S.B. Prusiner, Trends ftocftem. Sd 21 :482-487, 1 996.) 

animals and humans. It can be dangerous to eat the tissues of animals that con- 
tain PrP*, as witnessed most recently by the spread of BSE (commonly referred 
to as the "mad cow disease") from cattle to humans in Great Britain. 

Fortunately! in the absence of PrP*, PrP is extraordinarily difficult to convert 
to its abnormal form. Although very few proteins have the potential to misfold 
into an infectious conformation! a similar transformation has been discovered 
to be the cause of an otherwise mysterious "protein-only inheritance" observed 
in yeast cells. 

There Are Many Steps From DNA to Protein 

We have seen so far in this chapter that many different types of chemical reac- 
tions are required to produce a properly folded protein from the information 
contained in a gene (Figure 6-90). The final level of a properly folded protein in 
a cell therefore depends upon the efficiency with which each of the many steps 
is performed. 

We discuss in Chapter 7 that cells have the ability to change the levels of 
their proteins according to their needs. In principle, any or all of the steps in Fig- 
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Figure 6-90 The production of a 
protein by a eucaryotic cell. The final 
level of each protein In a eucaryotic cell 
depends upon the efficiency of each step 
depicted. 



ure 6-90) could be regulated by the cell for each individual protein. However, as 
we shall see in Chapter 7, the initiation of transcription is the most common 
point for a cell to regulate the expression of each of its genes. This makes sense, 
inasmuch as the most efficient way to keep a gene from being expressed is to 
block the very first step— the transcription of its DNA sequence into an RNA 
molecule. 



Summary 

The translation of the nucleotide sequence of an mRNA molecule into protein takes 
place in the cytoplasm on a large ribonucleoprotein assembly called a ribosome. The 
amino acids used for protein synthesis are first attached to a family of tRNA 
molecules, each of which recognizes, by complementary base-pair interactions, par- 
ticular sets of three nucleotides in the mRNA (codons). The sequence of nucleotides in 
the mRNA is then read from one end to the other in sets of three according to the 
genetic code. 

To initiate translation, a small ribosomal subunit binds to the mRNA molecule 
at a start codon (AUG) that is recognized by a unique initiator tRNA molecule. A 
large ribosomal subunit binds to complete the ribosome and begin the elongation 
phase of protein synthesis. During this phase, aminoacyl tRNAs—each bearing a 
specific amino acid bind sequentially to the appropriate codon in mRNA by forming 
complementary base pairs with the tRNA anticodon. Each amino acid is added to the 
C-terminal end of the growing polypeptide by means of a cycle of three sequential 
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Figure 7-5 Six steps at which 
eucaryotic gene expression can be 
controlled* Controls that operate at 
steps ! through 5 are discussed in. this 
chapter. Step 6, the regulation of protein 
activity, Includes reversible activation or 
inactlvation by protein phosphorylation 
(discussed In Chapter 3) as well as 
Irreversible inactlvation by proteolytic 
degradation (discussed in Chapter 6). 



Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein 

If differences among the various cell types of an organism depend on the partic- 
ular genes that the cells express, at what level is the control of gene expression 
exercised? As we saw in the last chapter, there are many steps in the pathway 
leading from DNA to protein, and all of them can in principle be regulated. Thus 
a cell can control the proteins it makes by (1) controlling when and how often a 
given gene is transcribed (transcriptional control), (2) controlling how the RNA 
transcript is spliced or otherwise processed (RNA processing control), (3) 
selecting which completed mRNAs in the cell nucleus are exported to the cytosol 
and determining where in the cytosol they are localized (RNA transport and 
localization control), (4) selecting which mRNAs in the cytoplasm are translated 
by ribosomes (translational control), (5) selectively destabilizing certain mRNA 
molecules in the cytoplasm (mRNA degradation control), or (6) selectively acti- 
vating, inactivating, degrading, or compartmentalizing specific protein 
molecules after they have been made (protein activity control) (Figure 7-5). 

For most genes transcriptional controls are paramount. This makes sense 
because, of all the possible control points illustrated in Figure 7-5, only tran- 
scriptional control ensures that the cell will not synthesize superfluous interme- 
diates. In the following sections we discuss the DNA and protein components 
that perform this function by regulating the initiation of gene transcription. We 
shall return at the end of the chapter to the additional ways of regulating gene 
expression. 



Summary 

The genome of a cell contains in its DNA sequence the information to make many 
thousands of different protein and RNA molecules. A ceil typically expresses only a 
fraction of its genes, and the different types of cells in multicellular organisms arise 
because different sets of genes are expressed. Moreover, cells can change the pattern 
of genes they express in response to changes in their environment, such as signals 
from other cells. Although all of the steps involved in expressing a gene can in prin- 
ciple be regulated, far most genes the initiation of RNA transcription is the most 
. important point of control 



DNA-BINDING MOTIFS IN GENE REGULATORY 
PROTEINS 

How does a cell determine which of its thousands of genes to transcribe? As 
mentioned briefly in Chapters 4 and 6, the transcription of each gene is con- 
trolled by a regulatory region of DNA relatively near the site where transcription 
^lins. Some regulatory regions are simple and act as switches that are thrown 
a single signal. Many others are complex and act as tiny microprocessors, 
responding to a variety of signals that they interpret and integrate to switch the 
neighboring gene on or off. Whether complex or simple, these switching devices 
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occur in the germ line, the cell lineage that gives rise to sperm or eggs. Most of 
the DNA in vertebrate germ cells is inactive and highly methylated. Over long 
periods of evolutionary time, the methylated CG sequences in these inactive 
regions have presumably been lost through spontaneous deamination events 
that were not properly repaired. However promoters of genes that remain active 
in the germ cell lineages (including most housekeeping genes) are kept 
unmethylated, and therefore spontaneous deaminations of Cs that occur with- 
in them can be accurately repaired. Such regions are preserved in modern day 
vertebrate cells as CG islands. In addition, any mutation of a CG sequence in the 
genome that destroyed the function or regulation of a gene in the adult would be 
selected against, and some CG islands are simply the result of a higher than nor- 
mal density of critical CG sequences. 

The mammalian genome contains an estimated 20,000 CG islands. Most of 
the islands mark the 5' ends of transcription units and thus, presumably, of 
genes. The presence of CG islands often provides a convenient way of identify- 
ing genes in the DNA sequences of vertebrate genomes. 

Summary 

The many types of cells in animals and plants are created largely through mecha- 
nisms that cause different genes to be transcribed in different cells. Since many 
specialized animal cells can maintain their unique character through many cell 
division cycles and even when grown in culture, the gene regulatory mechanisms 
involved in creating them must be stable once established and heritable when the 
cell divides. These features endow the cell with a memory of its developmental history. 
Bacteria and yeasts provide unusually accessible model systems in which to study 
gene regulatory mechanisms. One such mechanism involves a competitive interac- 
tion between two gene regulatory proteins, each of which inhibits the synthesis of the 
other, this can create a flip-flop switch that switches a cell between two alternative 
patterns of gene expression. Direct or indirect positive feedback loops, which enable 
gene regulatory proteins to perpetuate their own synthesis, provide a general mech- 
anism for cell memory Negative feedback loops with programmed delays form the 
basis for cellular clocks. 

In eucaryotes the transcription of a gene is generally controlled by combinations 
of gene regulatory proteins. It is thought that each type of cell in a higher eucaryotic 
organism contains a specific combination of gene regulatory proteins that ensures 
the expression of only those genes appropriate to that type of cell A given gene regu- 
latory protein may be active in a variety of circumstances and typically is involved 
in the regulation of many genes. 

In addition to diffusible gene regulatory proteins, inherited states of chromatin 
condensation are also used by eucaryotic cells to regulate gene expression. An espe- 
cially dramatic case is the inactivation of an entire X chromosome in female mam- 
mals. In vertebrates DNA methylation also functions in gene regulation, being used 
mainly as a device to reinforce decisions about gene expression that are made ini- 
tially by other mechanisms. DNA methylation also underlies the phenomenon of 
genomic imprinting in mammals, in which the expression of a gene depends on 
whether it was inherited from the mother or the father. 



POSTTRANSCRJPTIONAL CONTROLS 

In principle, every step required for the process of gene expression could be 
controlled. Indeed, one can find examples of each type of regulation, although 
any one gene is likely to use only a few of them. Controls on the initiation of 
gene transcription are the predominant form of regulation for most genes. But 
other controls can act later in the pathway from DNA to protein to modulate 
the amount of gene product that is made. Although these posttranscriptional 
controls, which operate after RNA polymerase has bound to the gene's promoter 
and begun RNA synthesis, are less common than transcriptional control, for 
niany genes they are crucial. 
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Figure 7-86 A mechanism to explain 
both the marked overall deficiency 
of CG sequences and their clustering 
into CG islands in vertebrate 
genomes. A black fine marks the location 
of a CG dinucleotide In the DNA 
sequence, while a red "lollipop" indicates 
the presence of a methyl group on the 
CG dinucleotide. CG sequences that lie in 
regulatory sequences of genes that are 
transcribed in germ cells are unmethylated 
and therefore tend to be retained in 
evolution. Methylated CG sequences, on 
the ofher hand» tend to be lost through 
deamination of 5-methyl C toT, unless the 
CG sequence is critical for survival. 
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HER-2/neu Breast Cancer Predictive Testing 

Julie Smtford Hanna, Php. and Dm Mornfn, MJ>. 

Each year, over 1 82,0Q0 women in the Unite* States are recurrence risk in women younger than 40 years of aee fe, 

dhignosed with breast cancer, and approximately 45,000 die low- vefsus high-amplified tumors (54.5% compared Yil 

t sease - lnc,<tence appears to be increasing in the 85.7%); this is compared to a recurrence rate of 1 6 7% for 

J"??'*' * * rate ° f 2% Per year. The reasons patients with no HER-2/neu gene amplification/ HER.2Meu 

K~ J ^ B ; - ; .'. ^...^ai.:. ..^ ,. • .^-•*-*JP»" (si [cay tumor size.* 
J F^year survival rates taiu#fi^%pTOxiniatt^^ ' ThTchofce of methodology for determination of 

85%, depending on demographic group, with a significant neu status depends in part on raeclirical setting, FDA aoorovaJ 
percMttge of women experiencing recurrence of theircancer for the Vysis FISH test was granted based on cllniciltriZ 
withm 10 years of diagnosis. One of the factors most predic- involving) 1549 node-positive patients. Patients received™* 
tmforreenrrence once a diagnosis of breast cancer hasbeen of threelifferent treatments consisting of different doses erf- 
made is Ae number of axiUary lymph nodes to which tumor 'cyclophosphamide, Adriamycin, and 5-fluoronracil (CAF> 
has metastasized. Most noderposinve women are given adju- The study showed that patients with amplified KEk-l/tJu 
vant ■ therapy, which increases their Survival. However, 20%- benefited from treatment with higher doses of adriamycin- 

based *erapy,while those with-normal HER-2/neu levels did 
not The study, therefore identified, a sub-set of women, who 
because mey did not benefit from more aggressive treatment, 
did not need to be exposed to the associated side effects. In 
addition, other evidence indicates mat HER-2/neu amplifica- 
tion in node-negative patients can be used as an independent 
prognostic indicator for eafly recurrence, recurrent disease at 
any time and disease-related death.* Demonstration of HER- 

1 , - • 2meu gene, amplification by FISH has also been shown to be 

over-expression of the HER-2/neu oncogene have also been of value in predicting response to chemotherapy In stane-2 
identified as having value regarding treatment regimen and breast cancer patients, 
ptog^is^ ; . Selection of patieiitsfo^^ 

_ fflp-2/i$i (u%> knOwTas c-erbB2) ifan oncogene that clonal anto^lfimp^tolever; is based upon detheastm- 
encodes a transmembrane glycoprotein mat is homologous don of HEFU2/neu protein overexpression using HercepTest™ 
tojbut distinct from, the epidermal growth factor receptor. Studies using Herceptin 0 in patients with metastatic breast 
Numerous studies have indicated mat high levels of expres- cancer show an increase in time to disease progression 
won of this protein are associated with rapid tumor growth, increased response rate to chemotherapeutic agents and a small' 
certain forms of therapy resistance, and shorter disease-free increase in overall Survival rate. The FISH assays have not yet 
survival. The gene has been shown to be amplified and/or been approved for this purpose, and studies lookme' afresDonse 
ovorexpjessed in 10%-30% of invasive breast cancers and m to Herceptin* in patients with or without gene amplification 
40%^o% of intraductal breast carcinoma.' .status determined by FISH are in progress. 

t I J?^ are ^^ 8tort ^^^^ me,hod8by w ' lich ,n «enetal,nSH and IHCresults correlate well However 

HER-2/neu status can be evaluated: immunohistochemistry subsets of tumors are found which show discordant results' 

(JHC, HercepTest™) and FISH (fluorescent in situ hybridiza- U., protein overexpression without gene amplification or lack 

tion, PathVysion™ Kit). Both methods can be performed on of protein overexpression with gene amplification The tHnl- 

archrved and current specimens. The first method allows visual cal significance of such results is unclear. Based on the above 
assessment of the amount of HER-2/neu protein present on . considerations, HER-2/neu testing at SHMC/PAML will utl* 

tite cell membrane. The latter method, allows direct quauttfi- lize immunohistochemistry (HercepTest 0 ) as a screen, fol- 

cation of the level pf gene amplification present In the tumor, lowed by FJSH in IHC-negative cases. Alt^tfvely either 

enabling differentiation between low- versus hjgh-amplifica- method may be ordered mdrvidually depending on the clini- 

tidh. At least one study has demonstrated a difference in cal setting or clinician preference. 
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30% -patients without axillary node involvement also 
develop xwinent disease^anil the difficulty lies in how to iden- 
tify this high-risk subset of patients, These patients could 
benefit from incensed surveillance, early intervention, and 
treatment 

Prognostic markers currently used in breast cancer recur- 
rence prediction include tumor size, histological grade, steroid 
hormone receptor status, DNAploidy, proliferative index, and 
cathepsin D status. Expression of growth fector receptors and 
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CPT code information 

HER-2/neu via IHC 

88342 excluding interpretive report) 

BER-2/neu via FISH 2 

S&71 *2 Molecular cytogenetics, DNA probe, each 
88274 Molecular cytogenetics, interphase in situ hybrid- 
isation, analyze 25-99 cells 
88291 Cytogenetics and molecular cytogenetics, interpre- 
tation an<j report 

Procedural Information 

Imbaunohistochemistiy is performed using the FDA-approVed 
DAKO antibody kit* Herc^)testP. The DAKO kit contains 
reagents required to complete a two-step imnronohisto- 
chemical staining procedqre for routinely processed, parafBn- 
«nbed4ed sf&^^.FoU 

rabbft antibodty to human HER-i/ncu protein, the kit employs 
a ready-to-use dextran-based visualization reagent This re- 
agent consists of both secondary goat anti-rabbit antibody 
molecules with horseradish peroxidase molecules linked to a 
common dextran polymer backbone, thus eliminating the need 
. for sequential application of link antibody and peroxidase 
conjugated antibody- Enzymatic conversion of the subse- . 
quentfy added chromogen results in formation of visible 
reaction product at the antigen site* The specimen is then coon- 
terstained; a pathologist using light-microscopy totetprets . 
results, 

FISH analysis at SHMC/PAML is performed using the 
FDA-apprbved PathVysion™ HER-2/neu DNA probe kit, pro- 
duced by Vysis, Inc. Formalin fixed, paraffin-embedded breast 
tissue is processed using routine histological methods, and then 
slides are treated to aflow hybridization of DNA probes to the 
nuclei present in the tissue section. thePathvysion™ kit con- 
tains two direct-labeled DNA probes, one specific for the 
alphwd repetitive DNA (CEP 1 7, Spectnnn orange) present at 
ihe chwnfcspAe^f^^ere^and tfarse&rafrfbr the HER- 
ifaeu oncogene located at 17ql 1^-12 (q)e<rtrumpeen). Enu- 
meration of the probes allows a ratio of the number of copies 
of chromosome 17 to the number of copies of HBR-2/neu to 
be obtained; this enables quantification of low versus high 
amplification levels, and allows an estimate of the percentage 
of cells with HER-2/neu gene amplification. The clinically 
relevant distinction is whether the gene amplification is due 
to increased gene copy number on the two chromosome 17 
homologies normally present or an increase in the number of 
chromosome 17s in the cells. In the majority of cases, ratio 
equivalents less than 2.0 are indicative of a normal/negative 
result, ratios of 2.1 and over indicate that amplification is 
present and to what degree. Interpretation of this data will be 
performed and reported from the Vysis-certified Cytogenet- 
ics laboratory at SHMC. 
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